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Abstract 



This thesis establishes new quantitative records in several problems of incidence 
geometry and growth. After the necessary background in Chapters 1, 2 and 3, 
the following results are proven. 

Chapter 4 gives new results in the incidence geometry of a plane determined 
by a finite field of prime order. These comprise a new upper bound on the total 
number of incidences determined by finitely many points and lines, and a new 
estimate for the number of distinct lines determined by a finite set of non-collinear 
points. 

Chapter 5 gives new results on expander functions. First, a new bound is 
established for the two- variable expander a + ab over a finite field of prime order. 
Second, new expanders in three and four variables are demonstrated over the 
real and complex numbers with stronger growth properties than any functions 
previously considered. 

Finally, Chapter 6 gives the first bespoke sum-product estimate over function 
fields, a setting that has so far been largely unexplored for these kinds of problems. 
This last chapter is joint work with Thomas Bloom. 
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Standard notation 



The following standard conventions are used. 

• Write Y = 0{X), Y < X, or X = Q(Y) if there is a fixed constant C such 
that Y < CX. The constant C is referred to as the 'implicit constant'. 

• Using this notation, we will often use the fact that X <C Y + Z if and only 
if X < max{y,Z}. 

• If X < y and y < X then write X « y or Y = 0(X). 

• If the implicit constant depends on some parameter A then this is reflected 
with a subscript, e.g. Y = 0\(X) or Y <Ca X. 

• Write y < X 1+ °W, y < X, Y = 6{X) or X = £l(Y) if y < e X 1+e for all 
e > 0. In particular, this notation is used when 7 <C I (logX) a for some 
a > 0. 

• Write 1 for an indicator function, that is if V is a proposition then 1 (V) is 
equal to 1 if V is true and is zero otherwise. 

• The letters M, C, N, Z have their usual meaning. By ¥ p we mean a finite 
field of prime order p, and by ¥ q a finite field of order q, where q = p a is 
a prime power. The letters p and q will also be used to denote points in a 
plane or higher- dimensional vector space, but the context will ensure that 
no confusion arises. 
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Standard notation 



• For subsets A, B of a field, write A + B = {a + b : a E A,b E B} and 
AB = {ab : a E A,b E B} for the sumset and product set of A and B. 
The difference set A — B and ratio set A/ B are defined analogously. 

• If x is an element and A a subset of a field then write A+x for the translation 
A + {x} and xA for the dilation {x} A. 

• If k is a natural number and A is a subset of a field then write kA for the 
iterated sumset kA = A + . . . + A . Context will ensure that no confusion 

k 

between a dilation and an iterated sumset of A arises. 
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Prologue 



'And if you take one from three hundred and sixty-five what remains?' 
'Three hundred and sixty-four, of course. ' 

Humpty Dumpty looked doubtful, 'I'd rather see that done on paper, ' he said. 

Lewis Carroll, Through the Looking-Glass 

This thesis sets new records at the interface of two areas of mathematics: 
incidence geometry and growth. The first of these, incidence geometry, is about 
points and lines in a plane and incidences between them; a point is 'incident' to 
a line if it lies on that line. The following sorts of problem are typical: 

Incidence bounds. Incidences are counted with multiplicity, in the sense 
that if several lines cross at a single point then we count several incidences. 
In total, how many incidences could there be between a finite set of points 
and a finite set of lines? 

Line counting. A pair of distinct points determines a line. In total, how 
many distinct lines might be determined by pairs from a finite set of points? 

There are many natural generalisations, for example to higher dimensional vector 
spaces instead of planes, to curves instead of lines, and so forth. 

The second area, growth, is about how much bigger a finite subset of a field 
becomes after passing to its image under some function of two or more variables. 
Examples are: 
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Prologue 



Sum-product estimates. For a set A of given cardinality, how much 
bigger must the quantity max{\A + A\, \AA\} be? Here, A + A and A A are 
respectively the sets of pairwise sums and products of elements of A. 

Expander functions. Let F be the ambient field and n > 2 be an in- 
teger. What functions f : F n — > F are there for which the set f{A) = 
{/(fli, • • • , a n ) : di G A} is always much bigger than a finite subset A of F? 



There is a common theme to all of this: estimating how much regimentation 
can be forced onto a finite set. For example, on the incidence geometry side the 
existence of too many incidences corresponds to the existence of extremely struc- 
tured sets of points and lines. Understanding incidence bounds means placing a 
cap on such structure. As an example on the growth side, the only way in which 
the sumset A + A or the product set AA can be small is if the elements of A are 
arranged respectively in some kind of additive or multiplicative 'conspiracy', such 
as an arithmetic or geometric progression. Understanding sum-product estimates 
therefore means working out how far such conspiracies could coincide. 

There are applications of this theme, and of the specific problems above, to 
theoretical computer science and cryptography, where they can be used to obtain 
rigorous estimates of pseudorandom behaviour. However these are not pursued 
here, the motivation instead being for progress on the problems in their own right. 
The only exception to this approach of not worrying about applications is the 
interaction between the problems themselves. Incidence results have applications 
to growth, and growth results have applications to incidences, and this is one of 
the things that will contribute to progress on both. 

When it comes to results the philosophy is one of 'hard analysis', placing 
the emphasis on finding explicit bounds and making them as strong as possible. 
Quite how strong this might be depends very much on the choice of underlying 
field. Historically the attention was on real numbers. There are powerful results 
in this setting, which is still extremely active. With work, many incidence and 
growth results extend verbatim from the real to complex settings. More recently 
a lot of research has focused on finite fields^. This thesis presents work in all of 



1 Recall that the finite field F p of prime order p is simply the set of residues modulo p under 
addition and multiplication, and that the finite field ¥ q of prime power order q = p a is a degree 
a algebraic extension of F p . 
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Prologue 



these areas, and also breaks new ground in the relatively unexplored setting of 
function fields^. 

Some remarks on finite fields 

Much, but by no means all, of the work in this thesis concerns finite fields, and it 
is worth pausing to explain a couple of considerations versus the real and complex 
settings. There is also a slightly different set of considerations in the study of 
function fields, but these are deferred to Chapter since that is the only place 
they arise. 

There are two main complications in finite fields. The first is the existence 
of finite subfields, which must be ruled out before anything nontrivial can be 
said. In the case of the field ¥ p of prime order p, this collapses into a cardinality 
condition since there are no proper subfields. For this reason much finite field 
research focuses on the prime order case; the hoops that one must jump through 
are similar to the general case, but the mathematics is usually cleaner. 

The second complication is an issue of tools and difficulty. Many methods 
in the real and complex settings depend critically on their topologies and so do 
not extend to finite fields. This makes it a lot harder to prove things and forces 
a more combinatorial approach, which is a mixed blessing. The bad news is 
that quantitative results are usually not as strong. But the good news is that 
methods and results achieved in finite fields usually extend elsewhere without 
difficulty Thus finite field results carry a certain amount of moral authority as 
they correspond to the worst possible cases. 

Although not considered here in detail, it is worth remarking that there is an 
additional 'large set' paradigm for finite field results, with extensive literature. 
This features strong results for sets satisfying an additional minimum density 
condition, typically at least a square-root barrier. However the methods have a 
very different flavour, drawing for example on estimates for exponential sums. 
On top of this, they cannot be so easily exported to other settings; the interest 
here is in finite sets, but a minimum density condition in finite fields is usually 
analogous to a requirement for infinite sets elsewhere. 

2 The function field ¥ q (t) is the field of rational functions over the finite field ¥ q 
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Structure of the thesis 

The first three chapters build up background and preliminaries. 

• Chapters Q] and [2] introduce incidence geometry and growth respectively 
in more detail. They also record particular incidence and growth results for 
use in later chapters. 

• Chapter [3]is a handbook of results in sumset calculus, which is a workhorse 
of much of the later mathematics. 

The subsequent chapters are concerned with original results, developing state 
of the art incidence geometry and growth in several directions. Between them, 
there are new results on all four problems from page [TSJ incidence bounds, line 
counting, sum-product estimates and expander functions. Moreover they encom- 
pass real, complex and finite field settings, and also break new ground in the 
relatively unexplored function field setting. 

• Chapter |4] considers incidence geometry over finite fields and sets two new 
records. 

* The first is a new incidence bound. If P and L are a set of points and 
lines respectively in F^ with \P\, \L\ < N then, writing I(P, L) for the 
number of incidences between P and L, non-trivial bounds are of the 
form I(P, L) < iVi- £ for e > 0. 

A new bound of e > ^ — o(l) is established, holding whenever N < p. 
This improves by an order of magnitude on the previous bound of 

c — 10,678' 

* The second is a new line counting result. If P is a set of points in F^ 
with \P\ < p then either fidPl 1-0 ^) of the points are collinear, or P 
determines at least Q f |P| 1+ t§3~°W j distinct lines. 

This improves on previous results in two ways. Quantitatively, the 
exponent is stronger than the previously best-known 1 + And 
qualitatively, the result applies to all subsets of F^ satisfying the car- 
dinality condition; the previously best-known result applies only for P 
of the form P = A x A for A C F„. 
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• Chapter sets three new records for expander functions. 

* First, there is a result on two- variable expanders in finite fields. If 
/(a, b) = a(b + 1) then \ f(A)\ > |A| 1+ ^~ o(1) whenever A is a subset 
of F p with | A | < p 1 / 2 . This improves on the previous best-known 
exponent of 1 + ^ — o(l). 

* Second, there is a result on three- variable expanders for complex num- 
bers. The function g(a,b,c) = which has not been previously 
considered in this context, is shown to satisfy |g(^4)| 3> |A| 2_ °( 1 ' ) for 
any finite A C C. Previously- known functions with this property were 
all of four variables and applied only to sets of real numbers. 

* Third, there is a result on four-variable expanders for real numbers. 
The function h(a,b,c,d) = fegjfe^l > again not previously considered 
in this context, is shown to satisfy ^> \A\ 2 whenever A is a 
finite set of real numbers. The previously best-obtained bound for a 
function of four variables was j^l 2-0 ^. 

• Chapter [H] is joint work with Thomas Bloom and breaks new ground by 
establishing a sum-product estimate in the function field F 9 (t), a relatively 
unexplored setting for this kind of work. Function fields form an interesting 
intermediate case between the finite field and real and complex settings since 
they have an unusually rigid 'non-archimedean' topology. 

A sum-product estimate of 

max{\A + A\,\AA\} > g 

is established for any finite subset A of W g (t). The exponent of 1 + | — o(l) 
lies between the 1 + — o(l) known for finite fields and the 1 + | — o(l) 
known for real and complex numbers. 

There are two appendices, which summarise standard background informa- 
tion. 

• Appendix [X] covers standard pigeonholing results used throughout the 
thesis. The phrases 'by averaging', 'by Cauchy-Schwarz' and 'by dyadic 
pigeonholing' are deployed frequently and implicitly refer to results from 
here. 
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Prologue 

• Appendix [B] covers material on projective geometry necessary for Chapter 
H]and Chapter [5j Additional standard material on the projective theory of 
cross ratios is covered at an appropriate point in Chapter [5j 
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1 



Incidence geometry 



The prologue mentioned two areas of incidence geometry: incidence bounds and 
line counting. This chapter introduces them properly, with an emphasis on the 
real and complex settings, the case of finite fields being deferred to Chapter HJ 
In so doing it also records results which will be of use in Chapter |5j 

Incidence bounds are covered first, establishing a trivial estimate which holds 
regardless of the underlying field. Non-trivial incidence bounds depend on the 
setting; the best-understood case is the plane M. 2 , where the classical Szemeredi- 
Trotter theorem holds. Line counting is then tackled in M 2 , establishing Beck's 
theorem as a consequence of Szemeredi- Trotter. 

Three ways of generalising to higher-dimensions are also considered, as is a 
generalisation from the real to complex setting. 

Apart from Theorem II. 8[ which is a straightforward variation of existing re- 
sults, all of the mathematics in this chapter is drawn from the literature. 

1.1 Incidence bounds in a plane 

A plane is simply F 2 where F is a field. A point is an element of F 2 and a line 
is the set of points (x, y) G F 2 satisfying an equation 

ax + by + c = 

for fixed a,b,c G F that are not all zero. A point p is incident to a line I Hp El. 
If P is a finite set of points in a plane, and L is a finite set of lines, then write 
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Chapter 1. Incidence geometry 



I(P, L) for the number of incidences between points in p and lines in I, that is 



peP leL 



where 

l, if p e I 
' pl S 0, ifp<£L 



Incidences are therefore counted with multiplicity, as illustrated in Figure 11.11 




(a) One incidence (b) Two incidences 




(c) Five incidences 
Figure 1.1. Counting incidences 

It is worth remarking (see Appendix [B]) that a 'plane' could instead be taken 
to be the projective plane FF 2 containing F 2 , and 'lines' to be projective lines 
within it. Indeed, all the results in this chapter extend without difficulty to 
projective space. But since no properties of the projective setting will be used, it 
makes sense to stay with the affine one. Chapters H] and make active use of the 
properties of projective space, and we will cross that bridge when we come to it. 

An important topic in incidence geometry is the study of upper bounds on 
I(P, L) in terms of \P\ and \L\, referred to as incidence bounds. When consid- 
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1.1. Incidence bounds in a plane 



ering these, the most straightforward observation to make is that each point in 
P is incident to at most \L\ lines in L, implying that 

I(P,L) < \P\\L\. 

However it is immediate from the definition of points and lines that two lines in 
L are simultaneously incident to at most one point in P, and two points in P 
are simultaneously incident to at most one line in L. This fact, combined with 
the Cauchy-Schwarz inequality, leads to the following somewhat better estimate, 
which is nonetheless referred to as trivial. 

Lemma 1.1 (Trivial incidence bound). If P is a set of points in a plane, and L 
is a set of lines, then 

I(P,L) « min{|P| + |P| 1/2 |£|, \L\ + |L| 1/2 |P|} . 

Proof. Cauchy-Schwarz and fll.ll) imply that 

W£) 2 =(££v) 

\peP leL J 

< \ p \J2 E (1.2) 

peP h,heL 

Split the summation over lines I1J2 G L according to whether or not l\ and I2 
are distinct, so that 

E E Wb^EE^ + E E 8 Ph 6 Ph 
pePh,h&L peP leL P eP h^heL 

= i(p,l) + j2 E 

peP h^heL 

Substituting into (jl.2p yields 

I(P,L) 2 <\P\I(P,L) + \P\ E^A^- (1.3) 

h^heL peP 

Two distinct lines are simultaneously incident to at most one point, and so 

peP 
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Chapter 1. Incidence geometry 



whenever li ^ l 2 , since this is the number of points in P incident to both li and 
/ 2 . Hence from (II. 3p . 

I(P,L) 2 < \P\I(P,L) + |P||L| 2 . 

So either I(P,L) 2 < \P\I(P,L) in which case I(P, L) < |P|, or I(P,L) 2 < 
| P 1 1 L | 2 in which case 7(P,L) < |P| 1/2 |L|. Overall therefore 

J(P,L) < |P| + |P| 1/2 |L|. 

The proof that I(P, L) <C \L\ + |iy| 1 / 2 (P'| is the same, except that the roles of 
points and lines are reversed. □ 

It is often helpful to consider the 'critical' case where P and L have the same 
cardinality, say |P| = \L\ = N. In this case, the trivial bound from Lemma [1J] is 

/(P, L) < N 3/2 

and so non-trivial bounds will be of the form 

J(P, L) < N 3/2 - e 

for e > 0. 

The following standard result shows that best non-trivial bound that can be 
hoped for is e = |, that is /(P, L) < N A ^ . 

Lemma 1.2 (Constraint on incidence bounds). Let F be a field. If F has char- 
acteristic then for any positive integer N there is a set P of points in F 2 and 
a set L of lines, with |P|, \L\ w iV such that 

I(P, L) ss N 4/3 . 

If the characteristic of F is p > then the same result holds, but with the con- 
straint that N must be less than (|) 3 ^ 2 . 

Proof. Write l rs for the line given by y = rx + s. In the zero-characteristic case 
take 

P = {( x ,y) E Z 2 : 1 < x < N 1/3 ,l <y< 2N 2/3 } 
L = {l rs : (r, s) E Z 2 : 1 < r < iV 1/3 , 1 < s < N 2/3 } . 
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1.2. The Szemeredi- Trotter theorem 



There are (N) distinct points in P and G (N) distinct lines in L, and it is easy 
to check that each line in L is incident to (iV 1 / 3 ) points in P, meaning that 
there are Q(N 4 ^ 3 ) incidences. 

The characteristic p > case is the same; simply replace Z 2 in the construction 
with F 2 . The constraint that iV < (f) 3//2 ensures that all the points and lines are 
distinct. □ 



1.2 The Szemeredi- Trotter theorem 

What can be said about non-trivial incidence bounds? The answer depends 
on the underlying field over which the plane is defined. This section considers 
the classical case of the plane K 2 where Szemeredi and Trotter |46j proved the 
following flagship result of incidence geometry. 

Theorem 1.3 (Szemeredi, Trotter). If P is a set of points in M? and L is a set 

of lines, then 

I(P, L) < |P| 2/3 |L| 2/3 + \P\ + \L\. 

In the critical case this gives I(P, L) <C iV 4//3 , which is sharp up to the implicit 
constant by Lemma 11.21 Before proving Szemeredi- Trotter it is worth recording 
an immediate consequence which is often useful in applications to other problems. 

Corollary 1.4. Let L be a set of lines in M 2 . Then the number of points incident 
to at least k lines in L is O ( ^p- + ) . Similarly, the number of lines incident 



to at least k points in a point set P is O ^^p- + 
Proof. Let be the set of points incident to at least k lines in L. Then 

\P k \k<I(P k ,L). 
On the other hand, the Szemeredi- Trotter theorem shows that 

I(P k ,L) < |P fe | 2 / 3 |L| 2 / 3 + |L| 
and so comparing upper and lower bounds on I(P k , L) gives 

, , \L\ 2 \L\ 

as required. The proof for the number of lines incident to at least k points is 
similar. □ 
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Chapter 1. Incidence geometry 



Now for a proof of the Szemeredi- Trotter theorem. There have been several 
proofs since Szemeredi and Trotter's original. The one here is due to Szekely [45J. 
A third proof, using different techniques again, can be found in a recent paper of 
Kaplan, Matousek and Sharir [27J. 

Proof of Theorem \1.0A Without loss of generality assume that every point is in- 
cident to at least one line, and every line is incident to at least one point. Recall 
that a graph G(V, E) consists of a set V of vertices, and a set E of unordered 
pairs of vertices, called edges. 

Construct a graph G(V, E) as follows. Take the set V of vertices to be the set 
of points P. To construct the set E of edges, say that {pi,P2) is an edge if and 
only if pi and P2 are adjacent along a line in L, in the sense that the line segment 
connecting them is contained in a line from L and contains no other points from 



For each I 6 L write k(l) = ^2 peP S P i for the number of points p G P that are 
incident to L. Note that I contains k(l) — 1 edges from E, as illustrated in Figure 



P. 




Figure 1.2. k collinear points determine k — 1 edges. 



It follows that 



£i = 5>(0-i) 




l&L p&P 



I(P,L)-\L 



(1.4) 
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1.2. The Szemeredi- Trotter theorem 



We now apply a graph-theoretic result. A drawing of G = G(V, E) is an 
identification of each vertex in V with a distinct point in M 2 , and of each edge 
(it, v) G E with a curve connecting u and v. A crossing occurs whenever two 
curves corresponding to edges intersect away from a vertex. Write cross(G) for 
the minimum number of crossings in a drawing of G. We will employ the following 
lemma; see Chapter 8 of the book of Tao and Vu [48J for a proof. 

Lemma 1.5 (Crossing number lemma). Let G(V,E) be a graph with \E\ > 4|V|. 
Then cross{G) > 

Applying the crossing number lemma and (11.41) shows that at least one of the 
following bounds holds 

I(P, L) <C \P\ + \L\ (1.5) 
j(p r A3 

cross(G) » l |p|2 J . (1.6) 

Let's consider what happens when (11.61) holds. It is clear that 

cross(G) < \L\ 2 (1.7) 

since for two edges to cross it is necessary that two lines cross. Comparing the 
bounds fll.6p and ( II. 7p yields 

^^«cross(G)<|L| 2 
r I 

and hence 

I(P,L) < |P| 2/3 |L| 2/3 (1.8) 
whenever ( II. 6p holds. Thus either ( II. 5p or ( II. 8p holds, implying overall that 

I(P, L) < |P| 2/3 |L| 2/3 + \P\ + \L\ 

as required. □ 

The key to the above proof is the crossing number inequality. Beyond this the 
only properties used are the trivial facts that any two lines are simultaneously 
incident to at most one point, and any two points are simultaneously incident 
to at most one line. With a few modificationgj the same argument goes through 



1 See for example Theorem 8.10 of the book of Tao and Vu 
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Chapter 1. Incidence geometry 

when L is a set of curves rather than lines, such that any two points in P are 
simultaneously incident to at most a curves and any two curves in L are simulta- 
neously incident to at most /3 points in P. The implicit constant in the statement 
then depends on a and f3 and so the theorem becomes 

I(P,L) < tt)j9 |P| 2 / 3 |L| 2 / 3 + \P\ + \L\. 

The required changes are to replace the crossing number lemma for graphs with 
one for multigraphs, and to replace ( II. 7p with 

cross(G) < f3\L\ 2 . 

Pach and Sharir [35] took this approach further, to encompass the case where 
any k points, rather than any two points, are simultaneously incident to at most 
a curves. Note that the condition on curves remains the same; any two curves 
are simultaneously incident to at most points, rather than any k. 

Theorem 1.6 (Pach, Sharir). Let P be a set of points in M. 2 and L be a set 

of curves, such that any k points in P are simultaneously incident to at most a 
curves in L and any two curves in L are simultaneously incident to at most (3 
points in P. Then 

k 2k — 2 

I(P,L) < a ,0 \P\ — \L\—i + \P\ + \L\. 

1.3 Line counting and Beck's theorem 

This section gives an application of the Szemeredi- Trotter theorem to the line 
counting problem. Two distinct points in M? determine a line, and so for a set 
P of points we can consider the set L(P) of lines determined by pairs of points in 
P. This could have just one element, which would happen if all the points in P 
lie along a single line as in Figure [L3l (a). Or it could be as large as ('^') ~ \P\ 2 
if the points are in general position as in Figure 11.31 (b). 

The following theorem of Beck [3] asserts that, up to multiplicative constants, 
these two extremes are essentially the only two possibilities. 

Theorem 1.7 (Beck). If P is a set of points in M 2 then at least one of the 
following holds 
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(a) L(P) has only a single element 



(b) L(P) has ('2') elements 



Figure 1.3. The two extremes for counting lines. 



1. At least Q (\P\) points in P are incident to a single line. 



2. \L{P)\k\P 



Beck's theorem was originally published in the same journal edition as the 
Szemeredi- Trotter theorem. But it is in fact also a consequence of Szemeredi- 
Trotter. This latter proof, given here, is more straightforward and can be found 
for example as exercise 8.2.6 in the book of Tao and Vu [48J. 

Proof of Theorem \1.7\ For each I 6 L(P) write f/,(l) = J2 P &p^pi f° r ^ ne number 
of points p G P that are incident to I. It is clear that 



since the right-hand side counts pairs of distinct points in P incident to I. It 
follows that 



For fixed p\ 7^ P2 there is precisely one line in L(P) that is incident to p\ and P2, 

so 






l£L(P) 




leL(P) 

and hence by combining the above equations 




(1.9) 



l£L(P) 
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Now for each integer j let Lj be the set of I G L(P) for which 2 3 < fi(l) < 2 3+1 . 
Corollary 11.41 implies that 

\P\ 2 \P\ 

For points p, q G P write for the line determined by p and g. For any constant 
C, let Xc be the set of pairs of distinct points p,q G P for which 



C < Kin) < 

From f Tl~T0|) . 



|P| 



log 2 (|P|/C) 

|Jf |« E |L,|2 2 ^ 

j=log 2 C 

log 2 (|P|/C) / 2 . .. 

« E (w + ^) 22j (!■") 



i=iog 2 c 

Summing geometric series yields 

log 2 (|P|/C) 

=log 2 C 
log 2 (|P|/C) 

c 



23 C 

j=log 2 C 

1 ipi 



j=log 2 C 

and so (II. lip implies 



IP 



2 



\X C \ « (1.12) 

Comparing (II. 9p and fl 1 . 1 2 [) it is possible to pick a constant C sufficiently large 
that 

\xc\ < \ E MO 2 - 



2 



Thus for this fixed C, either 



E MO 2 » \p\ 2 - 

lEL(P):n(l)>\P\/C 

or 

E » i^i 2 

ieL(py.n(i)<c 
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In the former case there are f2(|P| 2 ) pairs (p, q) for which n(l pq ) > \P\/C, and 
so in particular there is at least one line in L(P) that is incident to fi(|-P|) points. 
This corresponds to the first case in the conclusion of the theorem. 

In the latter case there are f2(|P| 2 ) pairs of points p, g G P for which fi(l pq ) < 
C, meaning that there are f2(|P| 2 ) distinct lines in L(P). This corresponds to 
the second case in the conclusion of the theorem. □ 

Since it is a consequence of Szemeredi- Trotter, which extends to points and 
curves, the statement of Beck's theorem generalises in the same way. This can be 
further developed by using the Pach-Sharir theorem in place of Szemeredi- Trotter, 
for example to the following result. 

Theorem 1.8. For finite P C M?, write C(P) for the set of circles determined 
by non-collinear triples of points from P. For any P , at least one of the following 
holds: 

1. There are Vl{\P\) cocircular points in P. 

2. There are Vt{\P\) collinear points. 

3. \C{P)\ > |P| 3 . 

The details of the proof are similar to that of Beck's theorem, using instead 
the k = 3 case of the Pach-Sharir theorem and the fact that three distinct non- 
collinear points determine a circle. 

1.4 Incidence geometry in IR n 

It is interesting, and useful for applications to other problems, to consider inci- 
dence geometry in higher-dimensional vector spaces than the plane. This section 
generalises the material of the previous two sections from M 2 to M ra . Since Beck's 
line-counting theorem follows from the Szemeredi- Trotter incidence bound, the 
focus is on higher-dimensional incidence bounds, with their application to line 
counting in ¥L n left as an exercise. 

There are several ways to formulate higher-dimensional results. One is to 
simply consider point-line incidences in the higher-dimensional setting. Another 
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is to consider higher-dimensional objects than lines, for example incidences be- 
tween points and planes in M. 3 . Somewhere between these two is the study of 
'pseudolines' which may be higher dimensional than lines but intersect pairwise 
in at most 0(1) points. 

1.4.1 Points and lines 

The Szemeredi- Trotter theorem extends by a random projection argument to the 
setting where the points and lines lie in R n rather than IR 2 . Simply project P and 
L onto a randomly chosen 2-plane in IR n . With positive probability the number 
of incidences between the projected points and projected lines will be the same 
as those between the originals. 

Since Szemeredi- Trotter is sharp in M 2 it is also sharp in R n . However im- 
provements are available subject to constraints on the arrangements of lines so 
that, for example, not too many of them lie in any one plane. Guth and Katz 
[18] obtained a breakthrough result of this kind in M 3 which enabled them to 
solve the famous 'distinct distances' problem of Erdos. Their incidence result is 
as follows: 

Theorem 1.9 (Guth, Katz). Let L be a set of lines in R 3 of which no more than 
O ([LI 1 / 2 ) are simultaneously incident to the same point, plane or regulu^. Then 



the number of points in M 3 incident to at least k lines in L is at most O 



The bound in Theorem 11.91 should be compared to the Szmeredi- Trotter bound 



k w \L\ l l 2 but Theorem 11.91 is stronger for smaller k. 
1.4.2 Points and planes in M 3 

Some kind of nondegeneracy condition is necessary to say anything interesting 
about incidences between a set P of points and a set II of planes, since if all the 
planes in IT intersect along a single line that is incident to all the points in P, 



2 A regulus is a 'doubly-ruled' surface. That is, every point in a regulus R is incident to at 
least two lines entirely contained in R. 





arising from Corollary 11.41 when k <C 



Lj 1 / 2 . The two bounds agree when 



then I(P,U) = \P\\U 
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Edelsbrunner, Guibas and Sharir [8] proved the following result under the 
fairly strong condition that no three planes are collinear. There are also plenty of 
examples [H EJ [JJ [12] of point-plane incidence results under other nondegeneracy 
conditions. 

Theorem 1.10 (Edelsbrunner, Guibas, Sharir). Let P and IT be a set of points 
and planes respectively in M 3 . If no three planes are collinear then 

/(p,n) < |P| 4/5 |n| 3/5 + \p\ + |n|. 

In the original paper [8] this bound is multiplied by a factor of the form 
|P|°( 1 )|n| o(1 ). However Apfelbaum and Sharir [2] showed that this additional 
factor can be eliminated with more careful analysis and so the refined version is 
used here. 

Like the Szemeredi- Trotter theorem, the Edelsbrunner-Guibas-Sharir theorem 
has a standard corollary, the derivation of which follows precisely as for Corollary 



Corollary 1.11. Let U be a set of planes in M. 3 , no three of which are collinear. 
Then the number of points incident to at least k planes in II is O yjs- + jf^ ■ 

1.4.3 Points and pseudolines 

A set P of points and a collection V of varieties in IR n form a pseudoline system 
if 

1. Any pair of varieties from V intersect in at most 0(1) points in P. 

2. Any pair of points from P are simultaneously incident to at most 0(1) 
varieties in V. 

3. Points in P can be incident to varieties in V only at their smooth points. 

4. If two varieties intersect at a point in P then their tangent spaces intersect 
only at that point. 

Solymosi and Tao [H] proved the following result which generalises the Sze- 
meredi- Trotter theorem, up to a loss of o(l) in the exponent of |P|, to pseudoline 
systems in M. n for which the varieties are of dimension at most n/2. 



33 



Chapter 1. Incidence geometry 



Theorem 1.12 (Solymosi, Tao). Let n >2k and let P and V form a pseudoline 
system in IR n . If the varieties in V are at most k- dimensional, and all have degree 
at most 0(1) then 

I(P, V) < e |p|2/3+e|^|2/3 + |p| + | L |_ 

The case with n = 4 and k = 2 is particularly important in the next section, 
as it corresponds to a Szemeredi- Trotter theorem for complex numbers. In this 
case, Zahl [51] succeeded in eliminating the o(l) loss to give the following result. 

Theorem 1.13 (Zahl). Let P and V form a pseudoline system in 1R 4 . // the 
varieties in V are all 2- dimensional, and all have degree at most 0(1) then 

I(P, V) < |P| 2 / 3 |1/| 2 / 3 + \P\ + \L\. 

1.5 Other fields 

So far we have considered only real vector spaces. But we are also interested 
in other settings, in particular complex numbers and finite fields. Chapter H] 
examines incidence geometry over finite fields and proves several original results, 
so we keep our powder dry and consider only the complex setting here. 

Toth [19] announced 3 ] the following generalisation of the Szemeredi- Trotter 
theorem to C 2 in 2003, but nearly ten years later it is still in the review process. 

Theorem 1.14 (Toth). If P is a set of points in C 2 and L is a set of lines, then 

I(P, L) < |P| 2/3 |L| 2/3 + \P\ + \L\. 
The usual corollary to Szmeredi- Trotter applies here as well: 

Corollary 1.15. Let L be a set of lines in C 2 . Then the number of points incident 
to at least k lines in L is O (^5- + ) ■ Similarly, the number of lines incident 



to at least k points in P is O ^^p- + -^r^ • 



More recently, the work of Solymosi, Tao and Zahl on pseudolines yields 
Toth's result as a special case. Indeed, lines in C 2 are pseudolines in R 4 and so 
Theorem 11.141 is an immediate corollary of Theorem 11.131 



3 The 2011 preprint referenced here is a more recent version of the 2003 original. 
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One can of course then consider complex versions of the higher-dimensional 
generalisations in Section [L4l Theorem 11.141 extends to point-line incidences C n , 
in the same way as Szemeredi- Trotter, although it is not known if the Guth-Katz 
theorem for M. 3 generalises to C 3 . 
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2 



Growth 



Just as the last chapter did for incidence geometry, the purpose here is to in- 
troduce the two examples of growth highlighted in the prologue: sum-product 
estimates and expander functions. The chapter also records particular results 
that will be useful in Chapter |H 

The idea for sum-product estimates is that, for a subset A of a field, at least 
one of the sumset A + A and the product set AA must always be large. Chapter 
|4]will use finite field sum-product estimates as a key ingredient to proving new 
finite field incidence theorems. Chapter will establish a sum-product theorem 
in function fields, a relatively unexplored setting for the problem. 

Expander functions are slightly different from sum-products. Rather than 
considering both the sumset and the product set, the concern is with the cardi- 
nality of a single image set f(A) of some multivariate function /. A medley of 
new results on expander functions are proved in Chapter 

All of the mathematics in this chapter is drawn from the literature. 

2.1 Sum-product estimates 

Let A be a finite subset of a field F, and consider the cardinalities of the sumset 

A + A = {a + b : a, b G A} 

and the product set 

AA = {ab : a,b E A} . 
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It is clear that both these sets have at least the same cardinality as A. And it 
is certainly possible to have + A| pa \A\ or \AA\ pa \A\, for example if A is either 
an arithmetic or geometric progression. But the idea of sum-product estimates 
is that it is not possible to have both of these at once. Results are therefore of the 
form 

max {\ A + A\,\AA\} > \A\ 1+S 

with 5 > 0, holding for any finite set A C F that possibly satisfies some nonde- 
generacy conditions. Larger values of § correspond to stronger results. 

Sum-products were first studied by Erdos and Szemeredi [13] , who proved the 
existence of an unquantified 5 > holding for any finite set A of integers. They 
conjectured that 5 > 1 — o(l). It is not possible to remove the o(l) from this 
conjecture, as demonstrated by the case where A is the set of integers between 1 
and N; in this case \A + A\ pa \A\ but \AA\ pa \ A\ 2 ~° W . 

Sum-product estimates can of course be considered in the more general setting 
of rings instead of fields, and indeed Erdos and Szemeredi's initial work concerned 
the ring of integers. However, when working over general rings the sum-product 
phenomenon is constrained by the possibility of zero divisors. If A contains too 
many then the sumset and the product set might both be small. The motivation 
for sticking with fields in this thesis is therefore that avoiding zero-divisor con- 
siderations means working with an integrel domain, and when given an intergral 
domain one may as well work with its field of fractions. Tao [17] has recently 
obtained a general, non-explicit, sum-product result for rings in the absence of 
too many zero divisors. 

2.1.1 Real and complex sum-products 

The strongest known sum-product estimate is 5 > | — o(l), which was proved for 
any finite set of reals by Solymosi [43] and recently generalised to any finite set 
of complex numbers by Konyagin and Rudnev [30]. There is a fairly extensive 
list [9j dU |3U HU H2] of previous results in the real and complex settings. 

The incidence geometry of Chapter [T] yields a lot of information about sum- 
products via the Szemeredi- Trotter theorem. An example, not quite as strong as 
the more recent Solymosi and Konyagin-Rudnev bounds, is the following result 
of Elekes [9] . A more involved version of this approach can be found in another 
paper of Solymosi [42J. 
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2.1. Sum-product estimates 

Theorem 2.1 (Elekes). Let ACC be finite. Then 

msx.{\A + A\,\AA\} > 

Proof. Define a point set P and a line set L by 

P = (A + A) x (AA) 
L = {l ab : a,b e A} 

where l a b is the line given by y — a(x — b). It is clear that 

|P| = |A + A||A4| 
\L\ = \A\ 2 . 

Each line l a b G L is incident to at least \A\ points in P since for each c E A the 
point (6 + c, ac) is an element of P D Z a {>. Thus 

I(P,L) > \L\\A\ = \A\ 3 . 

Applying the the complex Szemeredi- Trotter theorem (Theorem II. 14p shows that 

I(P, L) < |P| 2/3 |L| 2/3 + \P\ + \L\ 

^\A + A | 2/3 1 | 2/3 1 A | 4/3 + \A + A\\AA\ + \A\ 2 . 

Comparing the upper and lower bounds on I(P, L) yields 

\A\ 3 < \A + A\ 2/3 \AA\ 2/3 \A\ A/Z + \A + A\\AA\. 

So either 

|A| 3 < \A + A\ 2 ^\AA\ 2 / 3 \A\ 4 / 3 

or 

\A\ 3 < \A + A||AA|. 
In the event of the former, rearranging gives 

max{|A + A\, \AA\} > \A\ 5/ \ 

On the other hand, if the latter holds then 

m&x{\A + A\,\AA\} > |A| 3/2 > \A\ 5/ \ 

Either way the proof is complete. □ 
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2.1.2 Finite field sum-products 

Sum-product results for finite fields must be prefaced with a nondegeneracy con- 
dition, to prevent the set A being too close to a copy of a subfield. For example 
if A is a field then max{|A + A\, \AA\} = \A\ and so non-trivial estimates are 
impossible. To keep this issue as straightforward as possible it is often convenient 
to concentrate on the finite field F p of prime order p, where it collapses into the 
requirement for an upper bound on \A\ in terms of p. 

Let's now see what estimates are known in F p . The breakthrough result was 
due to Bourgain, Katz and Tao [B] who proved the existence of an absolute 5 > 
dependent on 7 > that holds whenever \A\ < p 1 ^ 1 . 

Garaev [15] made the Bourgain-Katz-Tao result explicit, obtaining 5 > ~~ 
o(l) whenever \A\ < p 1 ! 2 . This has been subsequently improved by a variety of 
authors [SJ 1221 ED EH EI]- The most recent estimate is 5 > jj — o(l) whenever 
\A\ < p 1 ^ 2 , due to Rudnev |37j. Li and Roche-Newton [33] built on a technique 
of Katz and Shen [28] to extend this estimate to any finite field, not necessarily 
of prime order, so long as A is not too close to being a subfield. 

Chapter E]will use a particular form of Rudnev's i — o(l) result. Its statement 
involves the multiplicative energy E x (A) of A, which is the number of solutions 
to ab = cd with a, b,c,d G A; this is developed more fully in Chapter EJ As 
Rudnev remarks in his paper, the result is really an upper bound on E X (A) in 
terms of the sumset A + A, and works just as well when this is replaced with the 
difference set A — A. In these terms, the estimate can be formulated as follows. 

Theorem 2.2 (Rudnev). Let A C F p . // \A\ < p 1/2 then 



The estimate 5 > i — o(l) follows from the fact (again, see Chapter ED that 
E X (A) is greater than or equal to j^U. 

The proofs of all the above finite field results, including Theorem I2.2[ take 
the same overall approach, sketched below. 

Sketch of finite field sum-product proofs. Given a set A C F p consider the set 



E X {A) 



4 < \A -A\ VI 4 - 
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Note that if f ^ R(A) then 



\A + £A\ = \A{ 



since different pairs of elements from A give different elements of A + £A. Indeed 
if there were a solution to to 

a + £b = c + £d. 

with (a, b) 7^ (c, oQthen rearranging would yield the contradiction 

a — c 



b-d 



e R(A). 



Now, assume that R(A) ^ ¥ p . This is immediate if \A\ < p 1 ^, and with a 
little effort we may assume something very similar so long as \A\ <C p l l 2 . Since 
F p contains no non-trivial additive subgroups, there must exist 

a — b 



such that 



Therefore 



c — d 
a — b 



d 



\A\ 



A + 



e R(A) 
1 i R(A). 



c — d 

= |(c - d)A + (a-b + c- d)A\ 

< \AA - AA + AA- AA + AA - AA\ . 

Thus an iterated sumset of the product set AA must be large. With some work, 
and analysing multiplicative energy rather than the product set directly, it turns 
out that the only way this can happen is if either AA is large itself or A + A is 
large. □ 

As a final remark for this section, it is worth thinking about what a reasonable 
sum-product conjecture might be for finite fields. Since the conjecture in the real 
case is S > 1 — o(l), it might be natural to suppose that this should hold over ¥ p 
whenever \A\ < p 1 ^ 2 . After all, this condition ensures there is no danger of the 
sumset or product set filling up the whole field. However the following result of 
Bourgain |Q, refined by Garaev [15], shows that under this constraint the best 
that can be hoped for is 8 > |. To obtain 5 > 1 — o(l) will require \A\ < p 1 ^ 3 as 
a necessary condition. 
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Lemma 2.3 (Bourgain, Garaev). For any prime p and any integer 1 < N < p 
there is a set A CW P with \A\ ~ iV such that 

max {\A + A\,\AA\}<£ N 1/2 p 1/2 . 

In particular there is a set A C ¥ p with \A\ « p 1 ! 2 such that 

m&x{\A + A\,\AA\} < 

Proo/. Let g G F* be a generator of F*. Let M = p l / 2 N 1 / 2 . By rearranging orders 
of summation it is immediate that 

# ({9 n ■ 1 < n < M} n {y + j : 1 < j < M}) « M 2 . 
Therefore there exists j/GF p such that 

# ({2 n : 1 < n < M} n {y +j : 1 < j < M}) > — = N. 

p 

It is therefore possible to pick a set A contained in the set on the left hand side 
such that \A\ w N. Since A C {g n : 1 < n < M}, 

|AA| < M. 

And since A C + j : 1 < j < M}, 

|A + A| < M. 

So altogether, 

max{|A + A\, \AA\} < M = N 1 ^ 2 
as required. □ 

As explained in the prologue, we are not too worried in this thesis about the 
'large-set' finite field paradigm, where sets are subject to a minimum as well as 
a maximum density requirement. However it is worth noting that under this 
regime, with \A\ > p 2 ^ 3 , Garaev [15] obtained the bound 

max{\A + A\,\AA\} ^> N l/2 p 1 ' 2 

which is optimal in view of Lemma 12.31 
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2.2 Expander functions 

Expander functions are another example of growth, and a natural variation on 
sum-products. An n-variable expander function is a function / : F n — > F for 
which the set 

f(A) = {f(a u ...,a n ) : ai eA} 

has cardinality at least Q [\A\ 1+5 ^ for some 5 > and any A (Z F, again possibly 
satisfying some nondegeneracy conditions. Stronger expander results correspond 
not only to a larger growth exponent 5 as per sum-product estimates, but also 
to a smaller number n of variables. 

Sum-product estimates give a trivial method of constructing four-variable 
expander functions, because it is immediate that 

\{ab + cd : a,b,c,de A}\ > m&x{\AA\, \A + A\} . 

However it is often possible to do better, both in terms of fewer variables and 
bigger exponents. 

In the two-variable case, Elekes, Nathanson and Ruzsa [10] obtained 6 > \ 
for the function 

1 

a + - 

o 

whenever A is a finite set of reals. This was subsequently extended to 5 > ^ by 
Li and Roche-Newton [32] , and was also considered by Bourgain [1] in the prime 
finite field setting for an absolute 5 > 0. The same paper of Bourgain also gives 
the example of the function 

f(a, b) = a 2 + ab 

which has the peculiar property of not even requiring a and b to be drawn from 
the same set in order to establish a growth result. Hart, Li and Shen [19] obtained 
8 > 77^ over finite fields for the function 

— 146 

a + b 2 . 

However, the strongest-known two-variable expander in the real, complex and 
finite field setting is the function 

a + ab. 
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This function was first studied by Garaev and Shen [IB], and a new result is 
established in Chapter El 

In the four-variable case, an expander follows from the recent breakthrough 
of Guth and Katz [18J on the Erdos distinct distance problem. They showed 
that a finite point-set PCM 2 determines at least f2 (|P| 1_o( ^ 1 ' ) ) distinct pairwise 
distances. In the particular case P = A x A this shows that 

(a - b) 2 + (c - d) 2 

is a four- variable expander over R with 5 > 1 — o(l). This methodology was also 
adapted by Iosevich, Roche-Newton and Rudnev [2JJ to show that 

ad — be 

is likewise a four- variable expander with 5 = 1 — o(l). New, stronger, results on 
three and four- variable expanders over R and C are proved in Chapter 
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A handbook on sumset calculus 



This chapter comprises a useful arsenal of technical results for analysing sumsets 
and product sets. Everything is stated in terms of addition and sumsets, but 
extends without complication to multiplication and product sets. 

The basic tools of sumset calculus are the Plunnecke-Ruzsa inequalities, which 
relate the cardinalities of different sumsets. These will be of use in Chapters HI [5] 
and |H1 

Partial sumsets are a generalisation of sumsets, where interest is restricted 
to sums determined by only a subset of possible pairs. Balog-Szemeredi-Gowers 
type results reduce the study of partial sumsets to that of complete sumsets, 
but at the price of reduced efficiency Two such results will come in handy in 
Chapters H] and El 

The additive and multiplicative energy of a set were briefly touched on in 
Chapter but the treatment here is more developed. Energy is useful for two 
reasons. First, sets with small sumset have high energy, which is helpful to know 
when analysing their structure. Second, energy-based arguments go through just 
as easily for partial as complete sumsets, and so can in some cases be more 
efficient than a Balog-Szemeredi-Gowers approach. This is useful in Chapters H] 
and |5j 

Energy considerations also have a particular application in covering methods. 
When considering complete sumsets, these can sometimes more efficient than 
the Plunnecke-Ruzsa inequalities. And since they are energy-based, they apply 
equally well to complete and partial sumsets, and so in some cases are stronger 
than Balog-Szemeredi-Gowers type results. These will be useful in Chapter 
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Except for Lemma 13.141 and Lemma 13.15} which are original variations on 
covering lemmata, all of the mathematics in this chapter is drawn from the liter- 
ature. However it should be remarked that the very useful Lemma 13.41 does not 
seem to be widely known or used in the research community. 

3.1 Introduction 

Chapter [2] defined the sumset A + A and product set AA of a subset A of a 
field. This extends naturally to the sumset A + B and product set AB of two 
different sets A, B, defined as 

A + B = {a + b : a E A,b e B} 
AB = {ab : a G A,b G B} 

Define also the difference set 

A-B = A + (-B) 

and ratio set 

A/B = A(B\{0})- 1 . 

The essential problem of sumset calculus is that we are given a pair of sets 
A, B and another related pair C, D. Given information about the sum, difference, 
product or ratio set of A and B, what can be said about that of C and D7 

The results in this chapter are general, in the sense that they depend only 
on the fact that sets are finite and contained in an abelian group. So they are 
quoted and proved for sumsets A + B, taking + to be an abelian group operation 
rather than necessarily the additive group of a field. They therefore extend to 
multiplication and product sets without complication. In some instances it will 
be convenient to adopt slightly different notation in the multiplicative case to 
avoid confusion in later chapters; this will be indicated where appropriate. 

3.2 The Pliinnecke-Ruzsa inequalities 

Information on sumsets is often expressed in terms of cardinalities. There are 
trivial estimates 

max(\A\,\B\) < \A + B\ < \A\\B\. 
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Informally, a sumset set is 'small' if its cardinality is close to the lower bound 
max (\A\, \B\) and 'large' if it is close to the upper bound The 'efficiency' 

of an estimate corresponds to the number of factors of the form \A + B\ which 
turn up; the fewer the better since this allows the proof of stronger theorems. 

The most basic tool available for analysing sumset cardinalities is the Ruzsa 
triangle inequality, which applies to difference sets but not immediately to sum- 
sets. 

Lemma 3.1 (Ruzsa triangle inequality). For sets A,B,C we have 

\ A _ B ^\A-C\\B-C\_ 

l 

Proof. For each element x G A — B, pick a x £ A and b x G B such that 

X * 

Consider the map / from (A — B) x C to (A — C) x (B — C) given by 

f(x,c) = (a x - c,b x - c). 

We show that / is an injection. Suppose that f(x,c) = f(y,d). Then 

a x — c = a y — d (3-1) 
b x — c = by — d (3.2) 

Subtracting (13.21) from (13.11) gives a x — b x = a y — b y and so x = y, which in turn 
forces a x = a y and b x = b y . Plugging this back into (13. ip shows that c = d, and 
so (x, c) = (y, d). Thus / is an injection and 

\A-B\\C\ < \A-C\\B-C\. 

Rearranging gives the required inequality. □ 

Note that the essential observation used in the above proof is that 

a — b = (a — c) — (b — c) 

for any a, b, c. Variations on this approach are used several more times in the 
thesis, and are referred to as 'Ruzsa-type'. 
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The Ruzsa triangle inequality is a simple and useful tool, but it applies only 
to difference sets and it is often necessary to care about general sumsets. This 
can be mitigated by clever applications, for example in the case A = B taking 
C = —A gives 

\A-(-A)\\(-A)-A\ \A + A\ 2 



A- A < 



\A\ " \A\ 



but this is not always terribly efficient. Moreover the Ruzsa triangle inequality 
does not help very much when analysing iterated sumsets of the form 




Such situations call for Plunnecke's theorem, which was promulgated in this 
context by Ruzsa [2B] • A much shorter proof was recently given by Petridis [36J , 
for which we follow an exposition given by Gowers [T7] . 

Lemma 3.2 (Pliinnecke). For sets A,B there exists A' C A such that for any 
natural number k, 

\A' + kB\ < 1 1 ' 1 . (3.3) 



In particular, 

IfcRl < t 

\A\ k 



A + B\ k 



kB\ < n~7TfcZi ■ (3-4) 



Proof. Pick A' C A such that 

\A' + B\ 



K 



\M\ 

is minimal across all subsets of A. It suffices to prove that 

\A' + kB\ < \A'\K k (3.5) 

since by minimality in the choice of K, 

K< \A + B\ 



\A\ 

To establish ( 13. 5 p it suffices in turn to show that 

\A' + B + C\ < K\A' + C\ (3.6) 
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for any set C. Indeed, once ( 13. 6 p is established, inequality (13. 5p follows by induc- 
tion on k, since taking C = (k — 1)B gives 

\A' + kB\ < K\A' + (k - 1)B\ 

and by inductive hypothesis 

\A' + (k - 1)B\ < K k ~ x \A!\. 

Now to prove (13.61) . Induction is used here as well, this time on \C\. When 
\C\ = 1 it is immediate that \D + C\ = \D\ for any set D. In particular, 

\A' + B + C\ = \A' + B\ 

and 

K\A' + C\ = K\A'\ 

so (13.61) holds with equality by definition of K. Now suppose that \C\ > 2 and 
write 

c = a u {x} 

so that by inductive hypothesis, 

\A' + B + C'\ < K\A' + C'\. 

By inclusion-exclusion, 

\A' + B + C\ = \A' + B + (C U {x})\ 

= \A' + B + C'\ + | (A' + B + {x}) \(A' + B + C')\ 
= \A' + B + C'\ + \A' + B + {x}\ 

- \{A' + B + {x}) n {A' + B + C')\ 
<\A' + B + C'\ + \A' + B\-\ (A' + {x}) n (A' + C) + B\ 
= \A' + B + C"| + | A' + S| - | A' n (A' + C - {x}) + S| 
< K (\A' + C"| + \A'\ - \A' n (A' + C- {x})\) (3.7) 

On the other hand, 

\A' + C\ = \A' + C'\ + \(A' + {x})\{A' + C')\ 

= \A' + C'\ + \A' + {x}\ -\{A' + {x}) n (A' + C")| 

= | A' + C"| + \A'\ - \A' n (A' + C- {x})\ (3.8) 
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and so subsituting (13. Sp into (13. 7j) gives 

\A' + B + C\ < K\A' + C\ 
as required. □ 

Lemma [3.21 is typically applied in the form (13. 4p since this omits mention of 
the subset A' C A. However Katz and Shen [29] observed that with a little work 
it is possible to show that \A'\ ~ \A\, which can in some circumstances make 
(13. 3 p a better bet. 

Corollary 3.3 (Katz, Shen). For sets A,B, there exists A' C A with \A'\ > ^ 
such that 

\A' + kB\ < fe 



\A + B\ k 



\A\k- 1 ' 

In situations where nothing is lost by passing to a constant-proportion subset 
of A, Corollary 13.31 effectively implies that 

\A + A\ k - 1 

\ kA \ «* \ A \ k -2 » 

which is better than Lemma [3.21 by a factor of ^rjp- 

Proof of Corollary \y. HA Let A* be any subset of A with |A*| > By Lemma 
13.21 there exists A'^ C A* such that 





+ B\ 


k 




1 A* 


k 



\A' + kB\ < 1 *' *„ 1 < 



|A;i|A + g| fc 

\A\ k 

Apply the above observation recursively. Begin by taking A* = A to find 
Ax C A such that 

Ui + kB\ < fc 







k 




\A\ 


k 



Now take A* = A \ Ax to find A 2 ^ A disjoint from A\ such that 

U 2 + kB\ < fc 



|A 2 | |A + £?| fc 



Repeat this process until reaching n such that IJILi ^ * s °f cardinality at least 
M Let 

1=1 
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Since all of the are disjoint it follows that \A'\ > ^ and 

n 

\A' + kB\ < J2\ A i + kB \ 



i=l 



\A + B\ k J^. A 



\A\ k 
\A + B\ 



i=l 



\A\ k ^ 

as required. □ 

3.3 Partial sumsets 

For sets A, B and GCAxB, call the set 

A + B = {a + b: (a,b) eG} 

a partial sumset. In the case where G = A x B this collapses into the sumset 
A + B, referred to as the complete sumset where there is a possibility for 
confusion. The motivation for using the letter G is that GCixB corresponds 
to (the edges of) a bipartite graph connecting A and B. 

Analysing partial sumsets is critical to the research in Chapters S] and 
but the Pliinnecke-Ruzsa inequalities developed above apply only to complete 
sumsets and so are not immediately useful. Fortunately, it turns out that if G is 

G 

a reasonably large part of Ax B and the partial sumset A + B is small, then there 
are large subsets of A and B whose complete sumset is small as well. Results of 
this kind are called Balog-Szemeredi-Gowers type or BSG-type after the 
standard theorem in this area, due to Balog and Szemeredi and strengthened by 
Gower£. 

Despite their usefulness, Balog-Szemeredi-Gowers type results are dispropor- 
tionately costly. This is in the sense that they yield upper bounds on complete 
sumsets that are out of proportion to the density of G in A x B, and in partic- 



ular they pick up more than a constant factor when \G\ ~ Sections 13.41 



1 See Theorem 2.29 of [48] for a formulation and proof of the Balog-Szemeredi-Gowers theo- 
rem. Note however that the factor K 4 in equation (2.20) of that formulation should be replaced 
with K 5 due to an error in the text. 
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and 13.51 describe methods which do not see this distinction and so can in some 
circumstances be more efficient. 

This thesis uses two BSG-type results. The first result is more efficient but 
is applicable only when G is especially dense in A x B. It is a consequence of 
Exercise 2.5.4 in jig] . 

Lemma 3.4 (BSG-type for dense sets). Let < e < 1/4 and let G C Ax B with 
\G\ > (1 - e)NI-B|- Then there exists A' C A with \A'\ > (1 - v^l^l such that 



A'-A'l < e 



\A 


- B\ 


2 




A\ 





Lemma [3.41 does not seem to be widely used in the literature. But, where it 
is applicable, it is remarkably efficient for a BSG-type result. 

The second BSG-type result, due to Bourgain and Garaev [5], is less efficient 
but is on the other hand applicable in a wider variety of situations. 

Lemma 3.5 (BSG-type for less-dense sets ). For sets A, B and G C A x B there 
exists A' C A with \A'\ 3> ]§[ such that 



There is considerable overlap between the proofs of Lemma 13.41 and Lemma 
13.51 For both it is convenient to adopt the following definitions. 

For a G A, write Bc{a) for the set of b G B such that (a, b) G G. Call the 
cardinality of Bg{o) the G-degree of a, and the cardinality of the intersection 
Bg{o,i) H Bg{o:2) the joint G-degree of a\ and a2- 

For both results it suffices to find a large subset of A with the property that 
the joint G-degree of any two distinct elements is large. This is demonstrated by 
the following Ruzsa-type lemma. 

Lemma 3.6. Let G C A x B . Suppose that A' C A and H C A' x A' are such 
that every the joint G- degree of any (oj, 02) G H is at least K . Then 



A' - A'\ < 



\A\ 



4 \B\ 3 \A-B\ 4 
\GT 5 





K 
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In particular, if the joint G-degree of any pair of elements from A' is at least K 
then 



A' - A \ < 



G 

\A-B 



2 



Proof. For each x G A' — A' pick a\,a\ G A' such that 

Let Y C (A' - A') x B be given by 

Y = {(x,b):beB G (al)nB G (a 2 x )}. 

By hypothesis, 

\Y\ > \A'-A'\K. 
On the other hand, the injection 

(x, b) i — y (a x — b,a 2 x — b) 

G G 

from Y into (A — B) x (A — B) shows that 

\Y\ < \A-B\ 2 . 

Comparing the upper and lower bounds on \Y\ gives the result. □ 

The ease of finding sets satisfying the conditions of Lemma 13.61 depends on 
the density of G, corresponding to the two different BSG-type results. Over the 
next few pages, Section 13.3.11 builds the proof of Lemma 13.41 and Section 13.3.21 
builds the proof of Lemma | 



3.3.1 High-density partial sumsets 

This section proves Lemma 13.41 the BSG-type result for high-density partial 
sumsets. In view of Lemma 13.61 this is accomplished by the following. 

Lemma 3.7. Let e > and G C A x B with \G\ > (1 — e)|A||S|. There exists 
A' C A with \ A'\ > (1 — -y/e)|A| such that the G-degree of every a G A' is at least 

(i-V3|£|. 

In particular, any two elements of A' have joint G-degree at least (l — 2y/e)\B\ 
whenever e < 1/4. 
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Applying Lemma 13.61 to the conclusion of Lemma 13.71 with H = A' x A' and 
K = (1 — 2y/e)\B\ immediately yields the statement of Lemma [3.41 



Proof of Lemma 3. 7 Let A' C A be the set of a G A with G-degree at least 
(1 — yfe)\B\. It suffices to show that \A'\ > (1 — ^/e)|A|. To this end, observe 
that 

(l-e)LA||£|<|G| 

= Ei^wi 

<Ei fi i+ E (i-vi)iBi 

aeA' aeA\A' 

= |A'||B| + (|A|-|A'|)(l-v^)|B|. 

It follows that 

(l-e)|A|<|A'| + (l^|-|^l)(l-Vi) 
and so > (1 — y/e) as required. □ 

3.3.2 Less-dense partial sumsets 



This section proves Lemma [3.5[ the BSG-type result for less-dense partial sum- 



sets. The proof is not quite as direct as that for Lemma 13.41 and the main 
additional ingredient is the following lemma. 



Lemma 3.8. Let G C A x B . Then for any e > there exists A' C A with 
M 



> pi, and H C A' x A' with \H\ > (1 - e)\A'\ 2 , such that the joint G -degree 
of any (a 1 ,a 2 ) G H is at least 



e\G\ 



2\A\'\B\- 



Proof. Let H be the set of (ai, 02) E A x A with joint G-degree at least 2 |1p[,b[ • 
To prove the lemma it suffices to find A' C A with \A'\ 3> |^| such that 

\(A' x A') n if| > (1 -e)\A'\ 2 . 

It is clear that 



E |S G (ai) nfi G (a 2 )| < 

(oi,a 2 )^H 

and hence 



G 



2 



2 LB 



£ IWn W |lMM < | (3. 9) 
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Also, Y^b&B \Bg(P)\ = |G| and so by Cauchy-Schwarz 

\B G (a 1 )nB G (a 2 )\> 1 -^. (3.10) 



01,026^ 

Comparing (I3.9P and (13.101) shows that 

I 
\B\ 



£ IWnWl(i- """f g> )>g 



ai,a2&A 

Rearranging gives 

E E (> 

beB ai ,a 2 &A G (b) V 

So there exists beB such that 

ai,a 2 eA G (fe) 



l((ai,a 2 ) > |G| 



2 LB 



l((ai,a 2 ) > |G| 



2 LB 



Evaluating the left hand side shows 

\(A G (b)xA G (b))\H\ ^ \G\ 



Aa{b)\ A - ' > 



2\B\ 2 

Since the left hand side is at most |A G (6)| 2 it follows that |A G (6)| ^> 1^1. Take 



A' = A G (b) so that 



\(A'xA')\H\<e[\Af-^- 2 ) <e\Af. 



This implies \(A' x A') n H\ > (1 - e)\A'\ 2 as required. □ 



With this established, it is now possible to prove Lemma 13.51 

Proof of Lemma Iff. 51 Let e > be sufficiently small, and fixed. By Lemma 13.81 
there exists A' C A and H C A' x A' with \A'\ > jfj and \H\ > (1 - e)|A'| 2 , 

such that any (ai,a 2 ) G have joint G-degree at least Q ( r^^ j- Lemma [223 
implies 

, A , H A n \A-B\ 2 \A\ 2 \B\ 
\A' - A' < — 



|G| 
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By Lemma 13.71 there is a subset A" C A' with \A"\ 3> \A'\ such that any two 
a\,a2 G A" have joint if-degree at least (1 — 2-y/e)|A'|. A final application of 
Lemma [3.61 shows that 



H 



\A' - A 



'12 



\A\ 4 \B 


131 


A-5| 4 




G 


5 



| A" - A"\ < 1 , - ' < 
1^1 

as required. □ 



3.4 Additive energy 

Define the additive energy E + (A,B) of set A and to be the number of 
solutions to the equation 

a + b = a' + b' (3.11) 
with a, a' G A and b, b' G -B. Note that 

£ + (A£) = £+(A-#) 

since ( 13.1 ip holds if and only if a — b' = a' — b. Define the additive energy of a 
single set A to be 

E + (A) = E + (A,A). 

When working with the multiplicative group of a field, we refer instead to 
multiplicative energy, denoted by E X (A,B), i.e. the number of solutions to 
ab = a'b' with a, a' G A and b, b' G B. 

Energy and sumsets are closely related, since the only way in which |y4+S| can 
be small is if there are many pairs of elements giving the same sums. This would 
necessitate many solutions to (13.111) and thus a large energy. This intuition will 
shortly be made rigorous, but in order to do so it is first necessary to generalise 
and slightly reformulate energy. 

Just like generalising sumsets to partial sumsets, given G C A x B it is 
natural to consider the number of solutions to (13. lip with (a, 6), (a', b') G G. 
This restriction is called the additive energy of G and denoted by E + (G). As 
with partial sumsets, this collapses to the definition of E + (A, B) when G = AxB. 

The following lemma shows that additive energy can be helpfully formulated 
as an L 2 norm. 
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Lemma 3.9 (Reformulation of additive energy). For G C A x B, let Hg{ x ) 
denote the number of (a, b) G G with a + b = x. Then 

E + (G)= ^o(x) 2 (3.12) 

G 

x&A+B 

= »a(a + b). (3.13) 

(a,b)eG 

Proof. It is clear that 

E + {G)= #{{a,b),{a',b') e G : a + b = a' + b' = x} 

G 

x&A+B 

= #{M) eG:a + b = x} 2 

G 

x&A+B 
x&A+B 

which establishes (13.121) . Similarly, 

E + (G)= #{(a',b'):a + b = a' + b'} 

(a,b)eG 
(o,6)eG 

which gives f)3.13p . □ 
When G = A x B we have Hg{ x ) — \An(x — B)\ and the following corollary. 
Corollary 3.10. For sets A,B we have 

E + (A,B)= £ |An(x-5)| 2 (3.14) 

= E |^n(a + 6-5)| (3.15) 
= E |(5 + o)n(S + o')| (3.16) 

a,a'eA 

Proof. Equations f)3. 14j) and ( I3.15P follow immediately from Lemma 13.91 and the 
fact that hg{%) = \A D (x — B)\ when G = A x B. Equation (I3.16P follows by 
rearranging orders of summation from (13. 15ft . 

□ 
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Now let's make rigorous the earlier discussion of the relationship between 
energy and sumsets. The following estimates are trivial. 

\A\\B\ < E+(A,B) < \A\ 2 \B\. 

Energy close to is 'low' and that close to |y4| 2 |S| is 'high'. The following 

result shows that, as expected, small sumset implies high energy. This can be 
useful to know when analysing the structure of sets with small sumset. 

Lemma 3.11. If G C A x B then 

E+(A,B) > E + (G) > — ^p- — 

\A + B\ 

Note that in this lemma, partial and complete sumsets are on the same footing: 
unlike BSG-type results there is no additional premium to pay for working with 
partial sumsets. 

Proof of Lemma \3.11\ It is clear that 

£ ^g(x) = \G\. 

G 

xeA+B 

So by Cauchy-Schwarz, 

\G\ 2 < \A + B\ £ /i G (x) 2 . 

G 

x&A+B 

By Lemma [XH the right-hand side is equal to \A+ B\E + {G) and so 

E+{A,B) > E + (G) > —^7— 

\A + B\ 

which completes the proof. □ 

The converse to Lemma[3JJ]is false: high energy does not automatically imply 
small sumset. For example if A is the union of an arithmetic progression A\ and 
a geometric progression A 2 , each of cardinality |A|/2, then A is high-energy since 

E+(A) » E + (A{) w \A\ 3 . 
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However the sumset \A + A\ is also large since 

\A + A\> \A 2 + A 2 \ « \A\ 2 . 

What does hold, however, is the following partial converse. If a set has high 
energy then there must exist a fairly large G C A x A for which the partial sumset 

G 

A + A is small. This fact will not be required in the thesis and so precise details 
are omitted. 



3.5 Covering methods 

A covering lemma is a result of the following form. If A and B have small 
sumset then a large part of A can be 'covered by' (contained in the union of) 
a small number of translates of B or some modification of B. The canonical 
example is due to Ruzsa, for which a proof can be found in [48J: 

Lemma 3.12 (Ruzsa). For any sets A,B, the set A is contained in the union of 
translates of B — B 

Another more recent example is due to Shen [40] . In a similar spirit to the im- 
provement of Lemma I3T31 over Lemma [3T2l this enables a covering with translates 
of B instead of B — B, at the expense of leaving a small part of A uncovered. 

Lemma 3.13 (Shen). For sets A,B and e > there exists A C A with \A'\ > 
(1 — e)\A\ such that A is contained in the union of O e ( ^jgj 5 ^ translates of B. 

The proof of Shen's covering result is based on additive energy, and so with 
some tweaks applies to the situation where only a dense partial sumset, rather 
than a complete sumset, is small. In certain situations, such as in Chapter [51 
this can be a more-efficient substitute for Balog-Szemeredi-Gowers type methods, 
since there is no additional cost for dealing with partial rather than complete 
sumsets. 

Two original variations on this theme are used in Chapter The first, below, 

G 

essentially restates Shen's result with A — B replaced by A — B. 

Lemma 3.14 (Shen variation 1). Let G C A x B and < e < 1/4. If \G\ > 
(1 - e)|A||B| then there exists A C A with \ A\ > (1 - 2^)1^1 such that A is 

contained in the union ofO e I ^jjsr^ ) translates of B. 



59 



Chapter 3. A handbook on sumset calculus 



Similarly, there is a subset A" C A with \ A"\ > (1 — |v4_| such that A" in 
contained in the union of O e ^^jgp^ translates of —B. 

Proof. We shall prove the case for covering with translates of B, and remark on 
the slight alteration needed to cover with translates of — B. 

Since \G\ > (1 — e)|A||S| there is by Lemma 13.71 a subset A% C A with 
\Ai 1 > (I - suc h that every element of Ai has G-degree at least (1 — y/e)\B\. 

Now for any subset A* C A\ let = G fl (A* x 5) so that 

1^1 > [l-y/e)\A\\B\. 
By Lemma [3.111 it follows that 

£+(A , B) >^>^(W£)W. 

By Corollary EIOl 

= E+(A„-B) = \An(a-b) + B\ (3.17) 

a£A t ,beB 

and so for any A* C Ai there exist a G A», 5 G B such that 

|A*|(1-V^) 2 |S| 



|A*n(a-6 + S)| > 



G 

A-B 



Apply the above discussion to a sequence of subsets of A\. Begin by taking 
A* = A\ to find ax G Ax, b\ e B such that 



\A X n (ai - 6i + > 



|A|(l-y^) 2 |g| 

G 



The translate (a\ — bi) + B covers I^K 1 j 7 ^ 1 l B l elements of Ax. Discard A\ fl 

|A-B| 

(ai — bi) + B from At and let A2 be the set of elements remaining, now taking 
A* = A 2 and repeating the process. 

Iterate O e f ^^pM times until the set remaining is of cardinality no more than 

i/e|Ai|. Then take A' to be the set of elements discarded across all iterations, so 
that \A'\ > (1 - y/e)\Ax\. Since \A t \ > (1 - >/e)|A| we get \A'\ > (1 - 2^/e)|^| as 
required, which completes the proof for covering with translates of B. 
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The proof for covering with translates of —B is identical, except that in place 
of (1337)1 . the identity 



E+(A,,B) = \An(a + b-B)\ 



a£A*,beB 

is used instead. □ 

The second original variation on Shen's result has a slightly different formu- 
lation. Instead of covering a large part of A with translates of B, it yields a large 
part of G whose corresponding partial difference set is covered by few translates 
of B. 

Lemma 3.15 (Shen variation 2). Let G C A x B and < e < 1. Then there 

G> 

exists G C G with \G \ > (1 — e)\G\ such that A — B is contained in the union 
of O e ( ^ A ~\g\ A ^ t rans l a t es of B. 

Proof. Let G* be any subset of G. By Lemma 13.111 there are at least ^ 

\A-B\ 

solutions to 

a — b = a — V 

with (a,b), (a',b') G G*. So there exists a* E A for which there are at least 
— pairs (a, b) G G> with 

a — b G a* — £>. 

\G I 2 

In other words, the translate a* — B accounts for the differences of G * — pairs 

|A-B||A| 

from G*. 

Apply the above discussion to a sequence of subsets of G. Begin by taking 
G> = G to find a translate of B accounting for the differences of ^ pairs 

\A-B\\A\ 

from G. Discard these pairs from G\ and let G2 be the set of pairs remaining, 
now taking G* = G 2 and repeating the process. 

Iterate O t ( ^ A ~\cf A ^ times until the subset of G remaining is of cardinality 

no more than e\G\. Then take G' to be the set of discarded pairs, so that \G'\ > 

Ql / G \ 

(1 — e)|G|, and A — B is contained in no more than € I ^r^p^ ) translates of 



|G| 

B, as required. □ 
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Incidence theorems over finite fields 



Chapter [T] introduced incidence geometry but deferred discussion of the finite 
field case. This chapter now picks up that baton. 

The state of the art for incidences is weaker over finite fields than R and C, 
and the discrepancy is much greater than the analogous gap for growth results. 
This chapter narrows that divide with two new theorems: an incidence bound 
and a line-counting result. 

An earlier version of this work has been submitted to the European Journal 
of Combinatorics. A preprint [23], and a more recent update [23] also used here, 
are available on the arXiv. 

4.1 Results 

This section describes the two results proved in this chapter. 
4.1.1 Incidence bounds 

Chapter [T] showed that in the 'critical' case \P\, \L\ = N, non-trivial incidence 
bounds are of the form 

I(P, L) < iV 3/2 - £ 

for e > 0. The Szemeredi- Trotter theorem (Theorem 11.31) gives the sharp result 
e = | in M 2 and generalises to the complex setting C 2 (Theorem II .14[) . 

Chapter [2] noted that any nontrivial results on growth in finite fields must 
be predicated on keeping away from subfields. The same is true for incidence 
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bounds, but this time one must keep away from subplanes. Just like growth, to 
keep these considerations as straightforward as possible it is often convenient to 
work with the field F p of prime order p. 

To see why avoiding subplanes is necessary, consider the example where P 
is the whole plane F 2 and L is the set of all lines in F 2 . Taking N = p 2 gives 
\P\, \L\ « N, but since every line in L is incident to p points in P it follows that 
I(P, L) « iV 3 / 2 and so in this case a non-trivial estimate is impossible. 

Working over F 2 , Bourgain, Katz and Tao [6] proved the existence of a non- 
trivial e > 0, dependent on 7 > 0, whenever iV < p 2 ~ J . This has been made 
explicit in two cases: 

• In the 'small-set' regime N < p, Helfgott and Rudnev [20] obtained a bound 

• In the 'large-set' regime p 1+1 < N < p 2 ~ 7 , Vinh |50j obtained e > j. 

For the purposes of this thesis, where as explained in the prologue the emphasis 
is on small rather than large sets, the Helfgott-Rudnev bound is the one to beat. 

It is instructive to compare with the situation for growth and sum-products. 
With sum-products there is a growth exponent of 5 > | — o(l) over R and C, 
and 5 > tt — o(l) over finite fields, which is a bit weaker but not excessively so. 
By contrast the finite field incidence bound of e > 1Q g 78 is an awfully long way 
from the real and complex bound of e > | implied by Szemeredi- Trotter. 

The first new result in this chapter goes some way to redressing this disparity 
by obtaining a much stronger finite field incidence bound of e > — o(l). 

Theorem 4.1. Let N < p. If P and L are a set of points and lines over¥ p with 
\P\, \L\ < N then 

I{P,L) < ivi-sfe. 

4.1.2 Line counting 

Beck's theorem from Chapter [TJ shows that any set of points in M 2 satisfies at 
least one of two extremes. Either there are at least f2(|P|) collinear points, or 
the set L(P) of lines determined by pairs of points in P is of cardinality at least 

n(\p\ 2 ). 
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As with incidence bounds, a nondegeneracy condition is required for Beck- 
type theorems over finite fields. For example if P = W 2 then no more than |P| X / 2 
points are collinear, and so the first possible conclusion of Beck's theorem cannot 
hold. But the second cannot hold either since |L(P)| ~ |P|. 

As with incidence bounds, there have been explicit finite field versions of 
Beck's theorem in two instances: 

• In the particular case of the 'small-set' regime \P\ < p for which P = A x A 
with A C F p , Helfgott and Rudnev J2U] showed that \L(P)\ > \P\ 1+ ^f. 

• In the 'large-set' regime, Iosevich, Rudnev and Zhai [22] recently showed 
that |L(P)| ~ p 2 whenever \P\ > plogp. 

The 'small-set' Helfgott-Rudnev result is again, for the purposes of this thesis, 
the one to beat. The second new result in this chapter does so in two respects. 
First, there is a stronger exponent of ^ — o(l) in place of Second, the result 
holds for general P C F 2 with \P\ < p rather than simply those of the form 
P = A x A. 

Theorem 4.2. If P C W 2 and \P\ < p then at least one of the following must 
occur: 

1. At least Q (\P\) points from P are contained in a single line. 

2. |L(P)|>|P| 1+ ife. 

4.1.3 Structure 

The body of this chapter is concerned with the proofs of Theorems 14.11 and 
14.21 Both follow from new observations that efficiently relate incidences to sum- 
product estimates. 

Unlike the incidence material in Chapter HJ the analysis here makes crucial use 
of the properties of the projective plane. Section 14.21 uses this to show that the 
existence of a certain configuration of points and lines would imply the existence 

G 

of sets A, B and fairly dense G C A x B such that the difference set A — B and 

G 

the ratio set A/B are both small. Such a configuration is called a 'sum-product 
configuration'. 
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Section 14.31 goes on to show that if there are too many points incident to 
too many lines, which would arise if either Theorem 14.11 or Theorem 14.21 were to 
fail, then there must also exist a large sum-product configuration. 

The situation is then almost ripe for applying a finite field sum-product theo- 
rem from Chapter [2] to show that a sum-product configuration cannot arise, and 
so there cannot be too many incidences. The final hurdle is to relate the par- 
tial difference and ratio sets arising from a sum-product configuration to complete 
sets. This is accomplished in Section 14. 4l using a concoction of Balog-Szemeredi- 
Gowers type results from Chapter El 

Section 14.51 then uses the analysis from the previous sections to prove The- 
orems 14.11 and 14.21 

Section 14.61 gives pointers for further work. 

4.2 Foci and configurations 

This section uses the theory of projective transformations to interpret particular 
types of point-line configurations in terms of sum-product (or strictly speaking, 
difference-ratio) estimates. The key observation is that difference and ratio sets 
can be interpreted in terms of gradients, and that a projective transformation 
enables these to be viewed as point-line incidences. 

There are several pictures of points and lines in this section, apparently treat- 
ing them as if in ¥L 2 . These are for illustration only; they do not correspond 
rigorously to the plane W 2 , although the definitions they illustrate do. 

Let F be any field. Recall, or quickly check Appendix (Bj that the projective 
plane FF 2 is given by equivalence classes of F 3 \{(0, 0, 0)} modulo dilation. More 
concretely, it can be viewed as the union of F 2 with a 'line at infinity' l^. Points 
of Zoo correspond to gradients in FF 1 = F U {oo}, so that for each A G FF 1 there 
is a point p\ G l^ that is incident to all lines of gradient A. 

Recall also some theory on projective transformations of FF 2 . These are 
line-preserving permutations of FF 2 . A frame in FF 2 is a set of four distinct 
points, no three of which are collinear. For any two frames there is a unique 
projective transformation that maps one to the other. 

Now for some new definitions. If P is a set of points in FF 2 and p ^ P is 
an individual point then say that p is a i^-focus for P if P is supported over at 
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most K lines through p. This is illustrated in Figure 14.11 where the red point is 
a 3-focus for the blue points. Note that if p is a i^-focus for P then it is also a 
.fT-focus for any subset of P. 




Figure 4.1. A point set and focus 



We also give a highly suggestive name to a particular configuration of points 
and foci. Let P be a point set and Pi,P2,P3,P4 £ P be distinct points. Say that 
P and the Pi form a .ff-sum-product configuration if 

1. Each pi is a .fT-focus for P. 

2. There is a line, which we call the base line, incident to P2,P3 and p4 but 
not p\. 

3. No point in P is incident to the base line. 

Figure I4T21 illustrates sum-product configurations with (a) the base line in general 
position and (b) the base line at infinity. The red, black, green and yellow points 
are the foci pi and the blue points are the elements of P. 

The following lemma justifies the choice of definition by showing that a sum- 
product configuration does indeed correspond to a (partial) sumset and product 
set. 
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(a) Base line in general position 




(b) Base line at infinity 
Figure 4.2. Sum-product configurations 



Lemma 4.3. Let F be a field. Suppose P is a set of points in F 2 , and that it 
forms a K -sum-product configuration with points Pi,P2,P3,P4- Then there exist 
sets A,B C F and G C Ax B with 

\G\ = \P\ 

and 

g g 
\A\,\B\,\A-B\,\A/B\ < K. 
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Proof. There is a projective transformation mapping any frame (four points, no 
three of which are collinear) to any other frame, and so certainly there is one map- 
ping any non-collinear triple to any other. Since Pi,P3 and p^ are not collinear, 
and nor are [1, 0, 0], [0, 1, 0], [0, 0, 1], it is possible to pick a projective transforma- 
tion r such that 

• pi is sent to the origin [0, 0, 1] of F 2 . 

• ps is sent to [0, 1, 0] G loo- 

• p 4 is sent to [1, 0, 0] G loo- 

Since r preserves linearity and sends p^ and p±, which are incident to the base 
line, to points on loo, it in fact sends the whole base line to Zoo- 

Let G = t(P). This is contained in F 2 since none of the points in P are 
incident to the base line and so none are mapped into Zoo. 

The set G is supported over K 'vertical' line£ and K 'horizontal' lineal, since 
lines incident to [0, 1,0] G loo are all vertical and those incident to [1,0,0] G loo 
are all horizontal. Let A be the set of x-intercepts of the vertical lines and B be 
the set of y-intercepts of the horizontal lines, so that 

G C A x B 

and 

LAI, 151 < K. 



These correspond to the yellow and black lines respectively in Figure I4.2L 

Furthermore, G C A x B is supported over K lines through the origin. These 
are identified by their gradient, and a point (a, b) G G is incident to the line with 

G 

gradient £ if and only if | = £. Thus each element of A/B corresponds to a 



different line through the origin, i.e. a red line in Figure I4.2[ and so 

G 

\A/B\ < K. 

Finally, G is supported over K lines through r(p2). Since r preserves linearity 
and sends the base line, which is incident top2 ; to loo we know that t(j> 2 ) G loo- So 



1 that is, lines of the form x — c for some c E F 
2 that is, lines of the form y — c for some c £ F 
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all lines incident to r{jp2) have the same gradient, say A G F. These are identified 
by the intercept with the ?/-axis, and a point (a, b) G G is incident to the line with 

G 

intercept p if and only if a + Xb = p. Thus each element of A + XB corresponds 
to a different line of gradient A, i.e. a green line in Figure 14.21 Therefore 

\A + XB\ < K. 

Now let B' = -XB and G' = {{a, Xb) : (a, b) G G} to obtain 

\G'\ = \P\ 

and 

G' G> 

\A\,\B'\,\A-B'\,\A/B'\ < K. 

□ 



4.3 How to find foci 

Section I4T21 shows that a sum-product configuration of points and foci corresponds 
to an upper bound on partial sum-products. This section shows that such a 
configuration must arise whenever there are many point-line incidences. The 
main result is the following lemma. 

Lemma 4.4 (Finding a sum-product configuration). Let P and L be a set of 

points and lines respectively in a plane such that every point in P is incident to 
Q(K) lines in L. Suppose that K ^> ^tji , and that each line in L is incident to 
at most 

points in P. 

Then there exists a subset of P of cardinality f ^ji and points pi, P2, P3, p& 
with which it forms a O(K)- sum-product configuration. 

It will be helpful when proving Lemma |4"~41 to adopt some additional notation. 
For points p, q in the plane, let l pq be the line determined by p and q. Given a 
set P of points and a set L of lines, both satisfying the conditions of Lemma I4.4[ 
and a single point p G P, define 

P pL = {qeP: l pq G L} 
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so that P p L is the set of points in P incident to lines in L going through p. Thus 
P p l has p as an 0(K)-focus, since every point in P is incident to Q(K) lines in 
L. 

As a first step, it is convenient to record the following standard result. 

Lemma 4.5 (Rich points and lines). Let Pi be the set of points in P incident to 
at least 2\p\ ^ nes ^ n L- Then 

I(Pi,L)^I(P,L). 

Similarly, if L\ is the set of lines in L incident to at least I 2 P \l\ points P then 

I(P,Li)nI(P,L). 

Proof. We prove the result for points, leaving that for lines as an exercise. Let 
P2 be the set of points in P incident to at most J lfp^ lines in L. Then 



2\P\ 

1 

~2\P\ 



I(P 2 , L) = Y J *{leL incident to p} < < 

peP 2 

Since 

I(P,L) = I(P 1 ,L) + I(P 2 ,L) 

it follows that 

as required. □ 

Armed with Lemma 14.51 we now build incrementally towards Lemma 14.41 We 
shall show how to find individual foci, and simultaneous pairs of foci, before 
finding four arranged in a sum-product configuration. The initial results on 
finding singleton or paired foci are adjustments of similar methods used in the 
proofs of Bourgain-Katz-Tao [B] and Helfgott-Rudnev [20]; they are applied in 



the proof of the more developed Lemma 14.41 

The following result enables us find individual foci. 

Lemma 4.6 (Finding individual foci). Let P be a set of points and L a set of 
lines, such that every point is incident to Q(K) lines in L. Then there exists 
Pi C P with \Pi\ w \P\ such that 



P p l\ > 



K 


2 


P\ 




L 





for each p e Pi. 
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Proof. Since every point in P is incident to 0(A) lines in L it is immediate that 

I(P,L) « K\P\. 

Let L\ be the set of lines in L incident to Q = & (^Lp) points in P so 

that by Lemma 14.51 it follows that 

I(P,Li)«I(P,L)«#|P|. 

Now let Pi be the set of points in P incident to Q ( ^jpf^ j = ^ (K) lines in L\. 
By Lemma 14.51 again, 

I{P X M)~I{PM)~K\P\. (4.1) 

Since P\ C P and Li C L, each point in Pi is incident to at most 0(A) lines in 
L 1 and so 

I(Pi,Lx) <K|Pi|. (4.2) 

Comparing (14. ip and ( 14. 2 p shows that that |Pi| ^> |P|, implying that |Pi| ~ |P| 
since Pi is a subset of P. 

Now, each p G Pi is incident to Q (A) lines in L\, and each of these lines is 
incident to (-^p) points in P. So 

\PpL\ > 1-PpLj | S> 



A 


2 


P| 




I 





for each p G Pi, as required. □ 

The following result now takes the ability to find singleton foci, and uses it 
to find them in simultaneous pairs. 

Lemma 4.7 (Finding paired foci). Let P and L be a set of points and lines 
respectively in a plane such that every point in P is incident to 0(A) lines in L. 
Then there exist pi,p% G P such that 

\P\K 4 

\PpxL H P P2 L\ > |^| 2 ■ 

Proof. By Lemma HTEI there exists Pi C P with |Pi| « |P| such that \P p l\ ^> ^TlT 
for each p G Pi. In particular there is a point pi G P such that 

|p Pi lI > 



A' 


21 


P\ 




L 
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Applying Lemma 14.61 again, this time to P Pi l and L, there is a point p 2 G P Pi l 
such that 



P Pl L H P P2 l\ > 



K 2 


P Pi l\ 




L 





> — > 



K 4 


P\ 




L 


2 



as required. □ 

Now with the ability to find foci both one and two at a time, it is time for a 
proof of Lemma 14.41 



Proof of Lemma \4.4\ It suffices to find Pi,P2,Ps,P4 G P with 

\P\K S 



such that P2,P3 and p^ are collinear along a line in L that is not incident to 
P\. There will then exist an 0(i^)-sum-product configuration of appropriate size 
since: 

• Each p G P is incident to Q(K) lines in L, and so is an 0(f^)-focus for 
P p l- This means that the first condition in the definition of an 0(K)-sum- 
product configuration is satisfied for any subset of P Pi l ^P P2 l ^P P3 l HP P4 l- 

• The second condition of a sum-product configuration is clearly satisfied by 
virtue of the arrangement of Pi,P2,P3,Pi- 

( f\P\K 8 \ 

• No line in L is incident to more than O I ( 1 , L 4 J J points in P, and so 

P P ilH P P2 l^ P P3 l^ Pp 4 L contains a subset of cardinality 6 j for which 

the third condition of an 0(i^)-sum-product configuration is satisfied. 

Observe first that by Lemma [4. 71 there exist points pi,P2 G P such that 

\P\K 4 

\P PlL nP P2L \^> l -j^. (4.3) 

For convenience, define 

Q = P PlL n P P2L . (4.4) 

The proof will be complete if we can find distinct P3,P4 G Q collinear with p 2 
along a line in L that is not incident to p\ such that 

\P\K 8 

\Q P3L nQ P4L \ > t-j—. (4.5) 
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Now since Q C P, every point in Q is incident to Q(K) lines in L. So by 
Lemma [4.61 there exists Q\ C Q with |Qi| ~ |Q| such that 

IQplI » ^ (4-6) 

for each p E Q±. 

Since j?2 is an 0(i^)-focus for P P2 l, and Q± C Q C P P2 l, it follows that P2 is 
an 0(if)-focus for Qi as well. Let J C L be the set of O(K) lines through p 2 
supporting Q 1 , so that 

I(Qi,J) = |Qi| 

and 

|J| <K. 

Let Ji be the set of I E J incident to at least (^jp- j = Q J points in 
Qi. Observe that |Ji| > 2. Indeed, Lemma [4.51 implies 

I(Q 1 ,J l )^I(Q 1 ,J)^\Q\. (4.7) 

But since Qi C P and J x C L, and it is a hypothesis that each line in L is 
incident to at most 

I PI K 4 \ 



|L| 

points in P, it follows that each line in J\ is incident to at most 0(|Q| 1_0 ^) 
points in Q. Hence 

I(Qi,Ji)<|Ji||Qr- o(1) . (4-8) 

Comparing (14. 7p and (14. 8p gives |Ji| ^> IQI *- 1 * 1 . So by appropriate choice of 
constants in the statement of the theorem | J\ \ > 2 as claimed. 

Since there are at least two lines in J 1; and they are all incident to p 2 , at least 
one of them is not incident to p\. Fix this line I* E Ji, which will be the base line 
of the sum-product configuration. Since I* is incident to p 2 but not pi, it suffices 
to establish that (14. 5 j) holds for some distinct ^3,^4 G Q D Z*. 

Because Z* G Ji we have 

IQinr|»M (4.9) 
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From (14. 6 p it follows that 



K 2 \Q\ 



peQini* 



(4.10) 



On the other hand, by Cauchy-Schwarz, 



£ \q p l\<\q\ 1/2 

peQini* 



z2 \Qpi L n Qpa,l\ j 

\P3,P4.&Ql<~\l* J 



1/2 



/ 



\Q\ 



1/2 



\Qpd + £ \Qp- a l^Qp,l\ 



P3,P4&Qini* 

P3j L PA 



v 1/2 
/ 



If the first summation on the right were to dominate then it would mean 



£ iq p l|«iqi. 



Comparing with (14.101) and applying (14. 9p would then yield \Q\K <C \L\. By 

' \L\3 



(14. 3 p and (14. 4p . this would mean K <C \pn/ B ; an d so by an appropriate choice of 



12,13/5 

constant contradict the hypothesis K ^> ' l/5 . 

Thus the second summation on the right dominates instead and so by (I4.10p . 



K 4 \Q\ \Qnh\ 



P3,P4,&QlCll* 



Hence there exist distinct £>3,P4 G Qi H I* C Q fi /* such that 



|Q P3 l n <5 P4 l| 3> 



k 4 |q| _ |p|k 



> 



as required. 



□ 



4.4 Bounding partial sum-products 

Between them, Section 14.21 and Section 14.31 show that the existence of too many 

incidences implies the existence of A, B C ¥ p and a large G C A x 5 for which 

g g 

the partial difference and ratio sets \A — B\ and |^4/-B| are both small relative to 
\A\ and \B\. 
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This offends our sum-product sensibilities, since Chapter [2] showed that one 
or other of a product set or sumset must always be large. However those results 
were concerned with complete sumsets and product sets, whereas here only partial 
sum-products are controlled. Fortunately, Chapter [3] has tools designed for this 
kind of situation, in the form of Balog-Szemeredi-Gowers type results that allow 
passage from partial to complete sets. 

The main results in this section are the following 'partial sum-product' re- 
sults for F p . They will be proved using a combination of BSG-type results and 



Rudnev's finite field sum-product estimate Theorem 12.21 

Lemma 4.8 (Partial sum-products in ¥ p , vl). Let A,BCF P and G C A x B . 
If\A\ < p 1/2 then 

\G\ 55 <\A\™\B\ 37 \A- B\ 28 \A G /B\ 8 . 

Lemma 4.9 (Partial sum-products in F p , v2). Let A,5 C F p and G C A x B . 
If\G\ < p^^Bl then 

|Gf 7 < \A\ M \B\ 45 \A-B\ 28 \A ( /B\ 16 . 

Because sumsets and product sets are being analysed simultaneously, care is 
required when using Balog-Szemeredi-Gowers type results to prove these lem- 
mata. Results from Chapter [3J yield the following preliminary result, which cap- 
tures all the additive and multiplicative properties that need to be controlled. 

Lemma 4.10 (BSG-type for sum-products). If A, B C F p and G C A x B then 
there exists A' C A with \A'\ ^> such that 

— I I \B\ 

1. \A' A'\ « 4|B|3 

2. \a'/a'\ « \AhrmBi 

3. E X {A)> f 2|A/|4 . 

\A/B\2\A\2\B\ 



Proof. Let e > be sufficiently small. By Lemma 13.81 there exist A\ C A and 
H C A 1 x A 1 with 

\M»y- 
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and 

|#|>(l-e)|Ai| 2 

such that every pair (ai, 02) G if has joint G-degree at least 2 |lpmi • ^ Lemma 
13.6^ applied once additively and once multiplicatively, 

, a H A , \A-B\ 2 \A\ 2 \B\ 



\A/B\*\Am 



G 

H I4/R|2I4I2 



\G\ 2 ■ 

Apply Lemma I3T^1 once additively and once multiplicatively to obtain A 2 , A 3 C Ai 
with 



such that 



|A 2 |,|^|>(l-v^)|Ai 



1, , , l^i -^il 2 |A-B| 4 |A|W 

1, , IAM1I 2 |A/s| 4 |A| 4 |s| 3 



l^il ^ 



Let A' = A2 fl A3. By the cardinalities of A 2 and A3, 

m > a- 2^)1^1 »H 

That A' satisfies properties 1 and 2 is immediate from the above difference and 
ratio set estimates. To show that property 3 is also satisfied, let 

H' = Hn {A' x A'). 

Since both H and A 1 x A 1 are of cardinality at least (1 — 2-v/e) 2 1^4! | 2 we have 

\H'\ > (l-Sv^lAxI 2 

and so 

\Hf \A^ \G\ 2 \Ai\ 



E X (A') > -L^j— » -L-£_ » 



\A r /A'\ \A 1 ~/A 1 \ \A- B\*\A\*\B\ 
which completes the proof. □ 
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We can now prove Lemma 14.81 and Lemma 14.91 

A' r A Txri+Vi I A'\ ^ 

\B\ 



Proof of Lemma \4-8\ Apply Lemma [4.101 to obtain A' C A with \A'\ ^> 'Mr such 



that 

|BPI 2 l-4/B| 2 

Since \A\ <C p 1 ^ 2 , the set A' C A is of cardinality at most O (p 1//2 ) as well, and 
the sum-product estimate Theorem 12.21 can be applied to obtain 



G r *~* |(7|35 

|5| 4 |A| 8 |A/5| 8 ' ' 



Rearranging gives 



\G\ A3 \A'\ 12 <\A- B\ 2S \A G /B\ 8 \A\ m \B\ 25 



and so 

| G |55 < A\™\Bf\A- B\ 2 *\A/B\ l 



as required. □ 

Proof of Lemma f^~P[ Apply Lemma 14.101 to obtain A' C A with \A'\ ^> n§ such 
that 

G 



lA , _ A ^ \A- B m-w 

Unlike in the proof of Lemma where multiplicative energy was involved, these 
properties are preserved when passing to subsets of A' and so we may assume 
that | A' | ps j-|r[. Since |G| <C p l / 2 \B\ the set A' is then of cardinality at most 
O (p 1//2 ) and so Theorem 12.21 can be applied to obtain 

\B\ 28 \A\ 16 \A/B\ ie Vl ' W 

\A-B\ 2S \A\ 2S \B\ 17 



< 



|G| 



31 



78 



4 ■ 5. Proving Theorems \4-l\ and \4-^\ 

Rearranging gives 

|Gf 7 < \a\ m \b\^\a-b\ 28 \a/b\ w 

as required. □ 



4.5 Proving Theorems 14.11 and 14.2 



This section uses the analysis from Sections 14. 2[ 14.31 and 14.41 to give proofs of 
Theorems O and H2J 

Proof of Theorem\Jl\ Suppose that I(P,L) > N^ 2 ~ e . The aim is to show that 
e > 6^-o(l). 

First, a standard argument enables overly rich lines to be discarded. Let Li 
be the set of lines in L incident to at most (iV 1 / 2-1-6 ) points in P. We have 

J(P, Li) « N 3/2 ~ e 

since if L + is the set of lines in L incident to at least CN 1 ^ 2+e points in P then 

I(P,L + ) = J2Y, 5 Pi 

ieL + pi,p2 

<7^rAnp,L) + \pn 



< 



2N 3/2- 



C 



and so by an appropriate choice of constant I(P,L + ) < J ^' L ^ . 

By a dyadic pigeonholing there is a subset Pi C P and an integer K with 

\P X \K > N 3/2 - e (4.11) 

such that every point in Pi is incident to Q(K) lines in L\. Note moreover that 

K > N 1/2 - e (4.12) 



since |Pi| < N. Applying Lemma [4.41 to Pi and L\, at least one of the following 
is true: 
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1. j\ |Pi|1/6 . 

( Yip ik 4 \ 1_0 W\ 

2. There is a line in Li incident to at least f2 I ( 1 J I points in P\. 

3. There is a line in L\ incident to at least Q (( ^j^k j J points in P 1 . 

4. There exists P 2 C with |P 2 | ~ and points Pi,P2,Ps,P4 in an O(i^)- 
sum-product configuration. 

The first three cases are quickly dispensed with. For the first, applying (14.111) 
and then (14.121) yields e > ^ — o(l), which is far better than required. For the 
second, since every line in L\ is incident to at most N l l 2+e points in P, it follows 
that e > jq — o(l) as well. The third case is like the second, but this time yields 



c >i- (l). 



The fourth case remains, and is the core of the argument. Apply Lemma [4.31 
to P2 to obtain A, B C ¥ p with 



\A\,\B\<^K 



and G C A x B with 

|G| 

such that 



l^ll 


K 8 




L 


4 



G G 

\A-B\,\A/B\4ZK. (4.13) 



Note that either K < p 1 ^ 2 or K > since if 



P 



\G\ 

G\ 



1 ' 2 <K< 



pl/2 



then \G\ > p which contradicts the fact that |G| < N < p. 

If K < p 1 ! 2 then \A\ <C p 1 ! 2 and an application of Lemma [4.81 gives 

|G| 55 < K 109 

which implies by the cardinality of G that 

| Pi |55^331 < | L |220_ 



80 



4-5. Proving Theorems \4-l\ and\4.2\ 



Applying ( 14. lip . (14.121) and the fact that \L\ < N gives after rearranging 

N l/2 < ^331, 

which implies e > — o(l) as required. 

On the other hand, suppose that K > Assume also that K > p x l 2 since 
otherwise we are done by the previous paragraph. Note that with a fixed G, the 
bounds in (14.131) are not affected by passing to supersets of A or B. So assume 
\B\ ~ K and thus 

|G| 

W\ 

Hence Lemma [4.91 is applicable, giving 



« p 1/2 . 



|Gf 7 < K 133 

and therefore 

|Pl| 67 i^ 403 < |L| 268 
Since K > p 1 / 2 > N 1 / 2 and \L\ < N this means 

\Pi\ e7 K 67 < N 100 . 

By (14. lip it falls out that e > ^ — o(l), which is far better than required. □ 
Proof of Theorem\4.^\ For / e L(P), write fi(l) for the number of points in P 



incident to /. It is clear, as per the proof of Beck's theorem in Chapter [T] that 

leL(P) 

By a dyadic pigeonholing there exists Li C L(P) and an integer k such that 
nil) for all / G Li and 

\Li\k 2 > \P\ 2 . (4.14) 
To prove the theorem it suffices to show that either 

k>\P\, 

in which case there are f2(|P|) collinear points, or 



£133 < |p 



66 
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in which case k 2 < |P|ifi and so |£(P)| > \Li\ > |P| 1+ i53 as required. 

Note that I(P,Li) « So by another dyadic pigeonholing there exists 

Pi C P and an integer K such that every point in P\ is incident to Q(K) lines 
in L x and 



IP 



2 



\Pl\K> |Li|fc>i-J-. (4.15) 
Since |Pi| < |P| we also have 

^>^- (4-16) 
Applying Lemma [4.41 to Pi and L 1; at least one of the following is true: 

1. J\ <^ | Pi |l/ 5 - 

( ( \ P \ K i\ i-oMX 

2. There is a line in Li incident to at least Q I ( |X7p~ ) ) P om ts in Pi. 

3. There is a line in Li incident to at least Q [ Ojl^z ) J points in P 1 . 

4. There exists P2 C Pi with |P 2 | ~ and points Pi,P2,P3,P4 in a K-sum- 
product configuration. 

As with Theorem I4.1[ the first three cases fall easily. In the first case (I4.15P 
and ( I4.16P give 

IPI 6 

L£_l < 1 t i3 

By f)4.14p this in turn implies that k < 1 which is far better than required. In 
the second case, since all lines in Li are incident to 0(fc) points in P and hence 
O(k) points in Pi, it follows that 



k > 



\Pi\K 
\Li\ 



4 



By (Q5]) and fl^TB]) this gives |Z-i| 2 £; 5 > |P| 5 and so fT4TT4|) implies k > |P|. The 
third case similarly yields > |P|. 

The fourth case is left. Apply Lemma [4.31 to to P2 to obtain A, B C F p with 
IPI < K and GCAxB with 



IGI 



\Pi\K* 
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such that 

G G 
\A-B\,\A/B\ <C K. 

As with the proof of Theorem I4.1[ either K < p 1 ^ 2 or K > If K < p 

then as per the last proof, Lemma 14.81 gives 

|P 1 | 55 K 331 «|L 1 | 220 . 

By fl^TBD and (ETTBD this gives 

|p|386 < |£|220^,331 

By f04"jl we then get k 109 < |P| 54 , which is better than the sufficient A; 133 < |P| 66 . 
On the other hand, if K > and K > p l l 2 then by Lemma 14.91 



IPI 67 ^ 403 < \L, 



268 



By fETTSD and (OS]l this gives 

|p|470 < l^-,^! 268 ^ 403 

By (SUD it follows that 

< |p|66 

as required. □ 



4.6 Further work 

An intrinsic proof in the plane? The work here, and the previous work of 
Bourgain-Katz-Tao and Helfgott-Rudnev uses finite field sum-product estimates 
as an animating force. The progress in this chapter comes down to finding a 
particularly efficient way of relating this to incidences by showing that a 'sum- 
product' configuration of points and lines must arise if there are too many in- 
cidences, but that such a configuration cannot exist since it would contradict 
known finite field growth results. 

Is it instead possible to construct a proof that lives entirely in the plane? Such 
an approach, if it worked, could yield better incidence results. Since the Elekes 
result (Theorem 12. ip in Chapter [2] shows that incidence bounds yield growth 
results, this could also drive forward the study of growth in finite fields. 
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There may be some cause for optimism. The setup of points and lines that 
we called a 'sum-product' configuration is not only interesting because of its 
relationship to sum-products. It is also of intrinsic interest on the plane, where it 
invites comparison with Desargues' theorem. This theorem, actually a defining 
property of the projective plane, says that two triangles of points are perspective 
to a point if and only if they are perspective to a line, the meaning of which is 
illustrated in Figure 14.31 below. 



Figure 4.3. An illustration of Desargues' theorem. The two shaded triangles 
are perspective to the red point, and are also perspective to the line carrying the black, 
green and yellow points. 

A sum-product configuration should imply the existence of many pairs of 
triangles perspective to the same point and perspective to the same line, at 
the same three places. Could pursuing the geometric implications of such an 
arrangement be a fruitful line of inquiry? 
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Expander functions 



Chapter [2] introduced expander functions as a type of growth result. Recall that 
for a field F, an n-variable expander is a function / : F n — > F such that for 
any subset A of F, maybe satisfying some nondegeneracy conditions, the set 
f(A) = {f(ax, . . . , a n )} is of cardinality at least | ^4 1 1 ~ ,_f5 with 5 > 0. This chapter 
sets new records in two variables over finite fields, and in three and four variables 
over real and complex numbers. 

An earlier version of the two-variable finite field work formed one half of a 
joint paper [26^ with Oliver Roche-Newton has been accepted for publication in 
the Journal of Combinatorial Theory Series A. 

An earlier version of the three and four-variable real and complex results has 
been submitted to Discrete and Computational Geometry, and a preprint [25] is 
available on the arXiv. 

5.1 Results 

This section describes the new theorems proved in the chapter. 
5.1.1 Two variables 

As mentioned in Chapter |2j the strongest known two variable expanders is the 
function f(a,b) = a + ab, which was first studied by Garaev and Shen [16]. They 

1 The paper is the union of two theorems proved independently by the two authors. 
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proved three results about the size of the set f(A), depending on the ambient 
field and the density of A within it: 

If A C F p with \A\ < p 1/2 then \f(A)\ > \A\ 1+ ^ (5.1) 
If A C F p with \A\ > p 2/3 then \f(A)\ > \ A\ 1/2 p 1/2 (5.2) 
If A C R is finite then > (5.3) 

Result (15. 2p is sharp but ( 15.11) and (15. 3p are not. The first new result of this 
chapter is the following improvement on (15. ip . 

Theorem 5.1. If AC¥ p and \A\ < p l/2 then 

\f(A)\>\A\^. 

Result ( 15. 3p . has also been recently improved, in the same paper as that 
containing an earlier version of Theorem 15. 1[ to > 

5.1.2 Three and four variables 

As mentioned in Chapter [21 the breakthrough result of Guth and Katz [18] on 
the distinct distances problem yields the function 

(a - b) 2 + (c - d) 2 

as a four- variable expander over R with 8 > 1 — o(l). Iosevich, Roche- Newton 
and Rudnev [2T] used the same methodology to show that 

ad — be 

is likewise a four- variable expander with 8 > 1 — o(l). Both this and the Guth- 
Katz result are sharp, as shown by the case where A is an arithmetic progression. 

The next two new results in this chapter are examples of functions with 
stronger growth properties. The first is a function in only three real variables 
rather than four which is nevertheless an expander with 5 > 1 — o(l). 

Theorem 5.2. Let g(a, 6, c) = For any finite iCCwe have 

\g(A)\>\A\ 2 . 
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The second is a four- variable expander with 8 > 1 instead of 1 — o(l). 
Theorem 5.3. Lei /i(a, 6, c, <i) = fez^fezgj ■ F° r an U finite A C R we /iat>e 

> |A| 2 . 

As with the results of Guth and Katz and Iosevich, Roche-Newton and Rud- 
nev, the example of an arithmetic progression shows that Theorem 15. 21 is sharp up 
to logarithmic factors. However it is not clear that the same is true of Theorem 
15.31 in the case of an arithmetic progression one has > |^4| 3 and so there 

may well be scope for improvement. 

5.1.3 Structure 

In the rest of this chapter, Section 15.21 gives the proof of Theorem I5.1[ and 
Section 15.31 gives the proofs of Theorems 15.21 and 15.31 Section 15.41 suggests 
possible further work. 

5.2 Two- variable finite field expanders 

This section is concerned with proving Theorem 15.11 The overall strategy is 
similar to the finite field case in the work of Garaev and Shen [16] , exploiting the 
fact that 

f(A) = {a + ab : a,b e A} 

can be written as simply the product set A(A + 1). A sum-product philosophy 
suggests that at least one of A(A + 1) and the difference set A — A must be large. 
If it is the former then of course there is nothing to prove, so the idea is to deal 
with the latter case. That is, we want to show that if the difference set is large 
then so is A(A + 1). 

The innovation here is to take a more efficient route than [16], via the simple 
Ruzsa-type observation that if ab = cd then 

(a + ab) — (c + cd) = a — c. 

Section 15.2.11 establishes the new key result and shows three different ways 
of implementing it. Section I5.2.2l then deploys these in a modified sum-product 
type proof to establish Theorem 15.11 
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5.2.1 Bounding partial sumsets 

The following result is the key to our progress. 

Lemma 5.4. Let A,BC ¥ p , and let e > 0. There exists G C A x B with 
\G\ > (1 - e)\A\\B\ such that 

lA o |A(B + l)||g(A+l)||A/g| 

Proof. Without loss of generality assume that ^ A, B. Note that 

\AC\xB\ = \A\\B\. 

xeA/B 

Let X be the set of x e A/B for which \A n xS| > ^Jjl . Then 



xeX 



xex 

and so 

\Af]xB\ > (l-e)\A\\B\. 

xex 

Let G C A x 5 be given by 

G = {(a,i)GAxB:^l} 

so that 

|G| = \ A ^xB\ > (l-e)\A\\B\ 

xex 

G 

For each £ e A — B pick ag G A and & B such that 



lAnrrfil - 


►El 




x£X 


lAnrrfil - 


f e|A|] 



a 5 - 6 S = f . 



G 

Let S C (A — B) x A x B he given by 



5.2. Two-variable finite field expanders 



Note that 

\s\ » 

lbl >>e \A/B\ 

since there are \A — B\ choices of £, each of which has at least i^yffi associated 
pairs (c, d). We now show that 

|5| < \A(B + 1)\\B(A + 1)\. 

This will follow after showing that the map 

ip : S-> A(B + 1) x B(A + 1) 

-0(£, c, d) = (og + a^d, + b^c) 

is an injection, i.e. that for given (ti,t 2 ) in ip(S) there is only one choice of 
(£, c, d) G S 1 for which ip(£,c,d) = (ti,^)- Indeed for given (ti, t 2 ) in V'(S') it is 
immediate that 

£ = a e - fe £ 

= (a^ + a^d) — (&£ + 6^c) 
= *i - *2 

so we know £ and therefore ag and &£. We therefore also know (c, d) since 

ti = + a^d 
*2 = h + 

So if) is indeed an injection and the upper bound on \S\ holds. 
Comparing the upper and lower bounds on \S\ shows that 



\A\\B\\A-B\ 

Wb\ 

and so 



< £ \S\ < \A{B + 1)\\B{A+1) 



u G \A(B + 1)\\B(A + 1)\\A/B\ 

as required. □ 

Let's now apply this, using sumset calculus from Chapter [3j to give upper 
bounds on \A — A\ in terms of \A(A + 1)|. First, a Balog-Szemeredi-Gowers 
approach yields a quite-efficient relationship: 
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Corollary 5.5. For any set A C F p there exists A' C A with \A'\ w \A\ such that 

Proof. Apply Lemma 15.41 with A = B and some fixed sufficiently small e > to 
find G C A x A with \G\ > (1 - e)|A| 2 such that 

M g ,| , JA(A + 1)| 2 |A/A1 

|A - A| <C — . 



By Lemma [3.41 there exists A' C A with ~ \A\ such that 

„„ \A-A\ 2 \A(A + l)\*\A/A\ 2 
\ A \ \ A \ 

By Lemma I3TT1 applied multiplicatively, \A/A\ < ^^j 1 ^ and so the result fol- 
lows. □ 

Corollary 15. 51 is useful by itself, but we can do better in some places. Covering 
results are often used in sum-product proofs, and applying Lemma[3]T4]to Lemma 
15.41 yields one that will be helpful here. 

Corollary 5.6. Let A,B,C C F p; and A,B C xC + y for some x G F*,y e F p . 
Let < e < yg. T/ien (1 — e)\A\ elements of A can be covered by 

f\c{c+mc/c\\ 

V l^ll^l 2 / 

translates of B. Similarly, (1 — e)\A\ elements of A can be covered by this many 
translates of —B. 

Proof. Applying Lemma 15.41 to the sets A xy = C C and B xy = C C 
there exists G xy C x of cardinality at least ^1 — ^ such that 

\A xy (B xy + 1)| \B xy (A xy + l)\\A xy /B xy \ 



< 



141*1 

|C(C + 1)| 2 |C/C| 
\A\\B\ 



Then let 



to obtain 



|^ B |H^-'^I«^ (C + 1)|2|C/C| 



The result follows by applying Lemma [3. 141 to G, with e replaced by e 2 /4. □ 
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Applying Lemma [3.151 yields another helpful covering result. 

Corollary 5.7. LetO < e < \. Then there exists G C Ax A with \G\ > (l-e)\A\ 
such that A — A is covered by O e A ^ A+ ^ 3 ^ A ^ A ^ translates of A. 



Proof. By Lemma 15. 4[ with e replaced by |, there exists G 1 C A x A with 
\G'\ > (1 - f)|A| 2 such that 

.g> \A(A+l)\ 2 \A/A\ 

\A - A\ < e — ^ ■ 

Then by Lemma 13.151 there exists G C G' with 

|G|> (l-|)|G'|> (l-|)Vl 2 >(l-^)l^| 2 
such that A — A is covered by O e f-^^-^y^^^ translates of A, as required. □ 



5.2.2 Proving Theorem 15.1 



This section uses Corollaries 15. 5[ 15.61 and 15.71 to prove Theorem 15.11 

By Corollary 15.51 and passing to a subset of A if necessary we may assume 
that 

lA - A[< WffiT. (5 . 4) 
By Corollary 13.31 and again passing to a subset if necessary we may assume that 

\A-A-A-A\ < 1 , AV0 1 . (5.5) 

\Ar 



Now, by Corollary 13.101 and Lemma I3.11[ 



\A\ 



\a(A+l)nb(A + l)\ = E x (A,A+l)> {A{A+r 



a,b£A 

So there exists b G A such that 

J2 HA + 1) n b„{A + 1)| > j^j^j-yyj- 
By dyadic pigeonholing there exists A\ C A and iVeN such that 



|a(A+l)n6o(A + l)| ~ iV 
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for all a G A\ and 



\A 



3 



Since \Ai\ < \A\ this also implies 



\A 



2 



N * WXTW\- (5 ' 7) 



Now consider the set 



R{A X ) = <f- — j : 01,0,1,6 G A x ,a^ 0,1 ^ 6 
{ 7 - 6 

and break into two cases according to whether or not R(Ax) = ¥ p . 
5.2.2.1 R(Ax) ^ ¥ p 

The important thing about R(Ai) is that if £ ^ -R(Al) then 

|Ai + £Ai| = |G| (5.8) 
for any G C x ij. Note that (15. 8p employs a slight abuse of notation: by 

G 

Ai + £Ai it means the set of a + £6 for which (a, 6) 6 C It holds because there 
can be no repetition in A\ + £Ax, since if 

a + £6 = c + £d 

with (a, 6) (c, d) then £ = ffj G R(Ax) which is a contradiction. 

Now since R{A X ) ^ ¥ p there must exist £ = 2=| <= i2(A x ) such that £ - 1 ^ 
i?(y4i). For any G C Ai x Ai it follows that 



G 



\G\ = \A 1 + ^-l)A 



. g a - 13-1 + 5 . 
At + — j Ax 

1~o 

= \(i-6)Ax + (a-0-i + 6)Ax\ (5.9) 

where there is another abuse of notation in (15.91) . 

Now proceed to fix a particular choice of G. Let e > be sufficiently small 
and for convenience write 

_ \A(A + 1)\*\A/A\ 
N 2 \Ax\ 
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Applying Corollary 15.61 to the sets a{A\ + 1) and b$(Ai + 1) D a{A\ + 1) shows 
that there is a set A a C A\ with 

> (1- e)|Ai| 

such that aA a is contained in the union of O e (A) translates of b$A. Similarly, 
there are sets Ap, A y , As C Ai with 

IM \Ajl\A s \ > (l-e)|Ai| 

such that (3Ap and 7 Ay are contained in the union of O (A) translates of boA, an d 
5 As is contained in the union of O (A) translates of — b$A. Let 

A 2 = A a n Ap n A~, n A s 

so that 

|A 2 | > (l-4e)|A x |. 
By Corollary 15.71 there exists G C A 2 x yl 2 with 

\G\^\A 2 \ 2 ^\A X \ 2 

such that A 2 — A 2 is covered by O ( ^^"^p j translates of A 2 . Fix this choice 
of G so that from f|5.9|) . 



|A| 2 « 



( 7 -5)(A 2 -A 2 ) + (a-/3)A 2 



|( 7 - ^)A 2 - (a - gMj^i + 1)| 2 |AM| 
|A|3 

aA 2 - (3A 2 - 1 A 2 + 5A 2 \\A(A + 1)\ 2 \A/A\ 



< 



\A\ 3 



Since 



A 2 C v4 Q , Ap, Ay, As, 

and aA a , PAp^Ay are each contained in the union of 0(A) translates of b A, 
and S As is contained in the union of O (A) translates of — boA, it follows that 

|Ai|2 c , |^ a -/3^- 7 A 7 + ^||A(A + l)| 2 |AM| 

A 4 l6nA-6nA-6n^l 



A 4 


6 ^4 - 6 ^4 - 6 ^4 


-b A\\A(A + l)\ 2 \A/A\ 




- A — A — A\\A(A 4 


3 

1)\ 10 \A/A\ 5 


A^ 8 L4i 


7 
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By (I5.5p . this gives 



\A-A\ 


*\A{A- 


f 


1) 


\ 10 \A/A\ 5 


N*\A X \ 


71 




2 



Now by the Ruzsa triangle inequality (Lemma 13. ip applied multiplicatively, 

WAl £ (5 . 12) 



Applying Q and (157L2]) to floTTTD yields 

|^i| 2 « 



A 



2 _ |A(A+1)' 4; 



A^ 8 |Ai| 7 |yl| 28 
Rearranging and applying ( 15. 6 p and (15. 7p gives 

\A(A+ 1)| 44 » \A^N 8 \A\ 28 > , f > " l| 



~ |A(A + 1)| 



9 



and so 

+ 1)| > |A| 54 / 53 

as required. 

5.2.2.2 = F p 

Let E be the number of solutions to 

a + £b = c + £d (5.13) 

with a,b,c,d G Ai and £ E i?(Ax). Moreover, for each £ e -R(Ai) recall that 
£Ai) is the additive energy of Ai and i.e. the number of solutions 
to (I5.13P with £ fixed, so that 

E= E + {A u iA,). 

There are no more than | J-2(^4i ) 1 1 v4i | 2 = p|Ai| 2 solutions to (I5.13P for which 
(a, b) = (c,d). And there are no more than | | 4 solutions with (a, b) ^ {c,d). 
So in total 

J2 E + (A 1 ,ZA 1 ) = E<\A 1 \* + p\A 1 \ 2 . 
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Since \A\\ < \A\ < p 1 ^ 2 this gives 

E+iA^tAj^plA 

So there exists £ = 2=| g R(Ai) such that 
Moreover, for any A 2 C Ai we have also 
Now by Lemma I3.11[ 



\2 



\A 14 

£ + (A 2 ,£A 2 )> 



2 



\A 2 -£A 2 \ 



and so if |^4 2 | ~ \ Ai\ then 

|aA 2 -/3A 2 -7A 2 + 5A 2 | > 



A ~ P A 

A 2 y^ 2 

7 — 



2 



» |A| 2 . 

Let A a , Ap, A 7 , As be as before and fix 

A 2 = A a f] Afs f] Ay f] A s . 

This yields the same situation as considered at (15. lUj) in the R{A\) 7^ F p case, 
but with one less factor of ^^T^lg^^ to deal with. So we obtain (and in fact 
exceed) the required bound. □ 

5.3 Three and four- variable real and complex 
expanders 



This section is concerned with the proofs of Theorems 15.21 and 15.31 

Let's first place these in the context of the Guth-Katz proof on distinct dis- 
tances, which led to the example of the function (a— b) 2 + (c— d) 2 as a four- variable 
expander. This was based on a framework of Elekes and Sharir [TTJ. The idea, 
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when counting the number of distinct objects determined by a set, is to analyse 
functions under which that object is invariant. 

In the Guth-Katz proof, where the objective is to count distances, the ap- 
proach is to analyse the group SE 2 of orientation-preserving isometries. This 
analysis can be parameterised as an incidence problem of points and lines in M 3 . 
The usual Szmeredi- Trotter theorem is too weak to be of help directly, but Guth 
and Katz were able to amplify it to Theorem 11.91 which they developed using a 
novel 'polynomial partitioning' technique specifically for this purpose. 

The approach of Iosevich, Roche-Newton and Rudnev for showing that ad — be 
is a four-variable expander follows the same approach, but uses the fact that 
this function can be viewed as a determinant. Thus the approach is to analyse 
determinant-preserving maps, i.e. elements of SL 2 . Like the Guth-Katz result, 
this too required an application of Theorem 11.91 

The approach for proving Theorems 15.21 and 15.31 is likewise founded on the 
Elekes-Sharir paradigm. Whereas Guth and Katz counted distances by analysing 
isometries from the group SE 2 , and Iosevich, Roche- Newton and Rudnev counted 
determinants by analysing SL 2 , the functions g and h considered here are in- 
stances of cross ratios, which are preserved by the group PSL 2 of projective 
transformations of the line. 

There are several advantages to working with cross ratios: 

• As shown in Section I5.1[ we are able to prove quantitatively stronger ex- 
pander results. 

• A smaller arsenal is required. Neither Theorem 11.91 nor any other applica- 
tion of the Guth-Katz polynomial partitioning technique is required. In- 
stead only the Szemeredi- Trotter theorem on points and lines is used when 
proving Theorem 15.21 and only the Edelsbrunner-Guibas-Sharir theorem on 
points and planes is used when proving Theorem 15.31 

• Parameterising as an incidence problem is more straightforward. The Guth 
and Katz and Iosevich, Roche-Newton and Rudnev proofs go through a 
certain amount of hassle in order to make the parameterisation and verify 
that appropriate nondegeneracy conditions are satisfied. But with cross 
ratios and projective transformations everything falls out naturally. 
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In what follows, Section 15.3.11 describes the standard theory of cross ratios, 
establishing that they are invariants of projective transformations. Section l5.3.2l 
then identifies projective transformations in a natural way with points in three- 
dimensional projective space, and establishes how the transformations' behaviour 
corresponds to line and plane structures of points. Section 15.3.31 then uses the 
results of the preceding two sections to prove Theorems 15.21 and 15.31 



5.3.1 Cross ratios 

This section records some standard theory on cross ratios. 

Recall, or consult Appendix [Bj that for a field F the projective line FF 1 is 
the set of equivalence classes [x,y] of F 2 \ {(0,0)} modulo dilation, and that it 
can more concretely be viewed as the extended line F = FU{oo}by identifying 
[1, 0] with oo and [x, 1] with x G F. Under this identification, the cross ratio 
X(a, b, c,d) G F of four elements a,b,c,d G F is given by 

X(a, b, C, d) = -T r- -jr 

(b — c)(a — d) 

interpreted in the sense of limits where necessary. Note that the functions g and 



h from Theorems 15.21 and 15.31 are given by 

g(a, b,c) — X (oo, a, b, c) 
h(a, b, c,d) = X (a, b, c, d) 

So to prove the theorems it suffices respectively to show that 

# {X(oo, a, b, c) : a, 6, c G A} > |A| 2 ~ o(1) for any finite A C C 
# {X(a, b, c, d) : a,b,c,d G A} > \A\ 2 for any finite ACl, 

The importance of the cross ratio is that it is a projective invariant of quadru- 
ples, in the sense of the following result which can be found in [39], for example. 

Lemma 5.8. Let Oj G F be distinct for i = 1,2,3,4 and the same for 6j G 
F. Then X(a\, a,2, 03, 04) = X(b\, &2> &3j 64) if and only if there is a projective 
transformation in PSL2(F) that sends each ai to bi. 

Proof. By Lemma [B. II there is a unique projective transformation \x G PSL2(F) 
that sends at to bi for i — 1, 2, 3. We shall show that X(a\, 02, 03, 04) = X(pi, 62, 63, 64) 
if and only if \i also sends 04 to 64. 
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T~abc 



First note that X(a,b,c,d) = r abc (d) where r abc G PSL 2 (F) is the unique 
projective transformation that sends (a, 6, c) to (oo, 1,0). To see this it suffices 
simply to check that 

b — a (a — b)c 
c — b (b — c)a 

and then that r abc (d) = X(a, b, c, d). 
It follows that 

X(a 1 , a 2 , a 3 , a 4 ) = X(bi, b 2 , b 3 , 6 4 ) 

if and only if 

T ai a 2 a 3 (a A ) = T blb2b3 (b 4 ). (5-14) 

Now by definition of \x we have 



Taxaia?, ^b\b2b 3 ° 



and so by injectivity of r blb2b3 , equation (15. 14|) holds precisely when /i(a 4 ) = 6 4 . 
This completes the proof. □ 



5.3.2 Points, planes and transformations 

This section contains two results that apply to any field F. The first, a 'points 
lemma', identifies projective transformations from PSL 2 {F) with points in FF 3 . 
The second, a 'planes lemma', establishes that the behaviour of transformations 
corresponds to line and plane structures of their associated points. 

Lemma 5.9 (Points lemma). Define ip : PSL 2 (F) — >■ FF 3 by 



f p 


:)] 


I r 





= \p,q,r,s]. 



The map ip is well-defined and injective, and its image is FF 3 \ Q where Q is the 
quadratic surface given by ps = qr. 

Proof. That ip is well-defined and injective follows from checking that if ti,t 2 G 
SL 2 (F) then ijj[tx] = ip[t 2 ] if and only if t t = ±t 2 . That the image is PF 3 \ Q 
follows from the definition PSL 2 (F) = SL 2 (F) /±I. □ 

Lemma 5.10 (Planes lemma). Let ip be as in the points lemma. For each (a, b) 6 
F x F there is a plane n ab C FF 3 with the following properties. 
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1. If t E PSL 2 (F) then r(a) = b if and only if ip{r) G n ab . 

2. No three planes are collinear. 

3. Different pairs (a,b) G F x F determine different planes ix ab . 

4- Different pairs of planes {TT ab , 7r Cf j} intersect in different lines 7r a & fl tt C( i. 

5. For any A C F, a point p G FF 3 \Q is incident to at most \ A\ of the planes 
from {n ab : a,b G A}. 

Proof. From the theory in Appendix[Bj a projective transformation r = 
sends a to b if and only if 

ap + q 

— T~ = b > 
ar + s 

which is the same as 

ap + q — bar — bs = 0. 

For fixed a, b this is an homogeneous linear constraint on ip{r) = [p, q, r, s] G PF 3 
and so describes a plane in FF 3 , which we define to be 7c a b. Property 1 is satisfied 
by construction, and it is now straightforward to establish properties 2 to 5 in 
turn 

2. It suffices to show that three planes intersect in a point. Let (a, b, c) and 
(d, e, f) be two triples of distinct elements of F. By Lemma [B. II there is a 
unique r G PSL 2 (F) that sends (a, b, c) to (d,e,f). So 

VTad H n be fl 7T c/ = ^(t) 

which is a single point in PF 3 . 

3. If itab = ^cd for some (a, 6) 7^ (c, c?) then n a b fl 7r c d fl n e f is either a line or a 
plane for any third pair (e, /), which contradicts property 2. 

4. Suppose that 

T'ab l~l 7T cd = 7T a /f/ fl 7T c / d /. 

Then 

But by property 2 the set on the left hand side is a point, whereas that 
on the right is a line, unless 7i ab G {7T a 'b', ^&d'}- Similarly a contradiction 
follows unless -n cd G {^ a 'b' ^dd'} 
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5. Let p be a point in FF 3 \ Q, so that p = t/>(t) for some r G PSL 2 (F). For 
each a G A there is at most one b & A for which p is incident to 7r a b, as 
otherwise r(a) would take two different values. Counting over all a G A 
shows that p is incident to at most \A\ planes. 

□ 

5.3.3 Proving Theorems 15.21 and 15.31 

This section uses the results from Sections 15.3. II and 15.3.21 to prove Theorems 15.21 
and !5.3l We first give the proof of Theorem l5.21 which uses the Szemeredi- Trotter 
theorem. 

Proof of Theorem I5.H Since 

\g(A) \ = # {X(oo, a, b, c) : a,b, c G A} , 

we want to show that 

# {X(oo, a, 6, c) : a, 6, c G A} > |A| 2 " o(1) 

for any finite A C C. To this end write E(A) for the number of solutions to the 
equation 

X(oo, ai, a 2 , a 3 ) = X(oo, b ± , b 2 , b 3 ) (5.15) 

with each of the a« and bi in A. Write fj,(x) for the number of ai, a 2 , a 3 G A with 
X(oo, a,i, a 2 , a 3 ) = x. Then 

xeg(A) 

and Cauchy-Schwarz implies that 

w 6 « ( E ^)) <i^)i^)- 

Ve 9 (A) / 

So it suffices to show 
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By Lemma 15. S\ equation ( 15.1 5p is satisfied precisely when there exists r G 
PSL 2 { < C) that fixes oo and sends each a« to bi. So if we define 

T = |J {r : r(oo) = oo, r(a) = b} 

a,b&A 

and write N(t) for the number of (a, 6) G A 2 for which r(a) = 6, then 
£(A)= ^ 1 Woo, ai. as, as) = ^(oo, 6i, 62, 6 3 )) 

ai,a2,a3SA bi, b2,b3&A 

< 1 (r(aj) = foj for each i) 

a\,ai,az^A bi ,b 2 ,b 3 eA t6T 

<^iV(r) 3 . (5.16) 

Let ip be as in the points lemma. Define a set P of points by 

P = V>(T) 

and a set of L lines by 

£ = {7Ta6 n 7TOOOO : a, 6 G A} 

so that |L| rs |A| 2 . The points and lines all lie in the plane Tr^oo. Moreover, if 
we write m(p) for the number of lines from L incident to a point p G P, then 

N(T) = #{(a,b):r(a) = b} 

= # {(a, 6) : ^(t) G 7T a6 n TToooo} 

= m(V»(r)). (5.17) 
Combining (15. 16p and (15.171) gives 

P(A)<^mQo) 3 . ( 5 - 18 ) 

For each j G N write P., for the set of p G P with m(p) G [2 J , 2- ? ~ 1 ). Applying 
the complex Szemeredi- Trotter theorem in the form of Corollary 11.151 gives 

\L\ 2 \L\ 
101 



and so 



Chapter 5. Expander functions 



log \A\ 

E(A) « J2 \ p ^ 3j 

3=0 

log|A| /i 7-12 in 

3=0 V 

« U| 4 log \A\ 



as required. □ 



We now give the proof of Theorem 15. 3[ which uses the Edelsbrunner-Guibas- 
Sharir theorem. 

Proof of Theorem \5.3[ This time we want to show that 

\h(A)\ = #{X(ai, 02, o 3 ,o 4 ) : en G A} > |A| 2 

for any finite ACM. To this end, we this time write E(A) for the number of 
solutions to the equation 

X(at, o 2 , o 3 , 04) = X(bt, b 2 , h, b A ) (5.19) 



with the Oj, 6j G A. Using Cauchy-Schwarz as in Theorem 15.21 shows that 

HA)\ » Jg 

so it suffices to show that 

£(4) < |A| 6 . 

Equation (15.19P is satisfied precisely when there exists r G PSLzi^) that 
sends a, to bi for each z. Define 



T = |J {r : r(o) = 6} 



a,beA 



and write iV(r) for the number of (a, 6) G A 2 for which r(a) = 6. Then 

E(A)«^iV(r) 4 

r6T 
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Let ip be as in the points lemma. Define a set P C PR 3 of points by 

P = V(T) 

and a set IT of planes by 

II = {7r af) : a, b G A} 

so that |I1| ps |y4| 2 . If we write m(p) for the number of planes from II incident to 
a point p, then 

N(t) = m(ip(r)). 

So, following the same argument as for (I5.18P in the preceding proof, 

E{A) < ^m(p) 4 . 
peP 

For each j 6 N write Pj for the set of p G P with m(p) G [2\ 2 J+1 ). Then 
applying the Edelsbrunner-Guiber-Sharir theorem on points and planes in the 
form of Corollary 11.111 gives 

log |A| 

E(A) « J2 \ P P 4j 

3=0 




as required. □ 

5.4 Further work 

• Where else could Ruzsa-type observations lead? The Ruzsa-type 
observation that 

(a + ab) — (c + cd) = a — c 

whenever ab = cd leads to a doubling of the growth exponent for the func- 
tion a + ab over finite fields. Could similar observations enable the con- 
struction of other expander functions, or give improved bounds on existing 
ones like a + t, a + b 2 , or a 2 + abl 
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• What more can be proved using cross ratios? Cross ratios turned 
out to be very useful when constructing expanders in Theorems I5.2l ancl l5.3i 
For example, Theorem 15.21 is a sharp three- variable result whose bound of 
| g(A) | > |v4| 2 is as strong as the best previously known four-variable results. 
What more can be said? 

As a start, it seems reasonable to conjecture that the four-variable Theorem 
15.31 can be strengthened to > \A\ 3 . This would bring the theorem 

into line with the case where A is an arithmetic progression, which is a 
sharp example for Theorem 15. 2[ 

In terms of applications to other growth results, might the strength of cross- 
ratio estimates enable new things to be said about two- variable expanders, 
or about sum-product estimates? 
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A sum-product theorem in function fields 



This chapter proves a sum-product theorem in the function field ¥ q (t), showing 
that if A is a finite subset of ¥ q (t) then 

max{\A + A\,\AA\} > ? |A| 1+ ^ o(1) . 

The exponent of | — o(l) lies between the | — o(l) known in the complex 
setting and the -jj — o(l) known in the finite field setting. This reflects the fact 
that unlike finite fields, ¥ q (t) has an associated norm and topology, but that this 
norm is very different from that on C, with an unusually rigid 'non-archimedean' 
geometry. 

The material in this chapter is joint work with Thomas Bloom. 



6.1 Results 

Recall that ¥ q denotes the finite field of order q, where q = p a is a prime power. 
The prime p is the characteristic of F q , i.e. the least n such that 

# + + £ = 

n 

for all x E ¥ q . Throughought this chapter the letter p will be reserved for this 
characteristic. The function field ¥ q (t) is the field of rational functions of a 
transcendental element t over F„. Elements are therefore of the form 



N 

X = 

jr = -00 
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where the Xj are elements of ¥ q . Note that this means that although ¥ q is finite, 
the function field ¥ q (t) is not. 

As with finite fields, it is necessary to rule out the possibility of finite subfields 
to be able to say anything non-trivial about growth in ¥ q (t). But unlike finite 
fields, this is accomplished by a minimum rather than a maximum condition on 
the cardinality of a set A. The reason is that the only finite subfields of ¥ q (t) 
are ¥ q and its subfields: to rule these out it suffices to insist that A is a bit 
bigger than q. Think therefore of q as being small and fixed, as opposed to finite 
fields where it is taken to be very large indeed. A convenient way of capturing 
this necessity is to introduce an implicit dependency on q when formulating sum- 
product estimates, so that results are of the form 

max{\A + A\,\AA\} > g \A\ 1+S 

for an absolute 5 > and any finite A C ¥ q (t). 

As mentioned in Chapter [2j Li and Roche-Newton [33] obtained a sum- 
product estimate 5 > -j — o(l) for a finite field ¥ q whose order is not necessarily 
prime, extending the applicability of Rudnev's result for ¥ p . Because of its com- 
binatorial generality this proof should go through in the function field setting 
without additional complication. 

It is possible to do better by developing techniques specific to function fields. 
This chapter proves the following theorem 

Theorem 6.1. If AC ¥ q (t) is finite then 

\A + A\ 3 \AA\ 2 > g \A\ 6 -° {1 \ 

A sum-product result for function fields with 5 > | — o(l) follows immediately. 

Corollary 6.2. If A C ¥ g (t) is finite then 

max{\A + A\, \AA\} > g |A| 1+ ^ o(1) . 

The next section provides more background on function fields, and explains 
the structure of the proof and the rest of the chapter. 
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6.2 Function fields 

This section has two parts. The first part, Section 16.2.14 gives some standard 
background on the geometry of function fields. The second part, Section 16.2.21 
explains how this will be used in the proof of Theorem 16.11 and describes how the 
rest of the chapter is organised. 

6.2.1 Background 

The field ¥ q (t) has a norm or valuation | • | given by 



where deg(x) is the degree of x, i.e. the maximal j for which Xj is non-zero. 
This valuation has the non-archimedean property that 



which is stronger than the usual triangle inequality. As a consequence ¥ q (t) has 
an unusually rigid geometry, which will be exploited when proving sum-product 
estimates. A particular concern will be the behaviour of balls, which are as usual 
sets of the form 



In ¥ q (t), the non-archimedean property implies the following fact, which is con- 
sidered to be standard. 

Lemma 6.3. If B\ and B 2 are balls in ¥ q (t) then either they are disjoint, or 
B\ C B2, or B2 C B\. If in addition B\ and B2 have the same radius then either 
they are disjoint or Bi = B 2 . 

Proof. Let B\ = B(x,r) and B 2 = B(y, s). If there exists a G B(x,r) D B(y, s) 



\x — y\ < max {\a — x\,\a — y\} < max {r, s} . 
If r < s then this implies B(x, r) C B(y, s) since if b e B(x, r) then 

\y — b\ < max {\y — x\,\b — x\} < max {r, s} = s. 
Conversely if s < r then B(y,s) C B(x,r). Hence if r = s then B(x,r) = 




x + y\ < max {\x\, \y\} 



B(x, r) = {y G ¥ g (t) : \x — y\ < r} . 



then 



B(y,s). 



□ 
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6.2.2 Sum-product considerations 

The ¥ q (t) sum-product proof builds upon an approach of Solymosi [JT] for sum- 
products in C. When adapting this method, the non-archimedean geometry of 
¥ q (t) turns out to be a mixed blessing. 

First, the bad news. Solymosi's argument fails at a critical point in the ¥ q (t) 
setting, for the following reason. For each a 6 A, let a' G A \ {a} be such that 
| a — a' | is minimal, and let B a be the ball of radius \a — a'\ centred on a. Solymosi's 
method uses the crucial fact that a single complex number can be contained in at 
most 0(1) of the B a . This fails spectacularly in ¥ q (t): in this setting an element 
could be contained in as many as \A\ of the B a , as demonstrated by the example 

A = {t j : < j < n} 

where 

Bp = {xe¥ q {t) : |ac| < q 3 } 

for j > 1 and B\ = B t , meaning that every one of the \A\ balls contains as an 
element. 

But all is not lost. In the example above, the astute reader will notice that 
\A + A\ ~ \A\ 2 , and so a strong-sum product estimate holds despite the failure 
of Solymosi's argument. In fact we will be able to show that something like this 
is possible whenever the Solymosi argument fails, by defining and considering 
separable sets. 

Say that a set A is separable if its elements can be indexed as 

A = {ai, . . .,a\ A \} 

in such a way that for any 1 < j < \A\ there is a ball Bj with 

A n Bj = {ai, . . . , ctj} . 

Separability is fairly unexciting in the complex setting, but in the non-archimedean 
regime of W g (t) it is a stronger notion. The rigid geometry makes it harder to 
find separable sets, but where they do exist it will in fact imply the existence of 
large sumsets. The idea, therefore, is to show that a large separable sets must 
exist whenever the Solymosi approach fails. Combining this with an analysis of 
separable sets as having large sumsets will lead to a proof of Theorem 16. 1[ 
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In what follows, Section 16.31 analyses separable sets and develops the nec- 
essary results about their sumsets. Section 16.41 then adapts Solymosi's proof 
from jJTJ to establish that if \A + A\ and \AA\ are both small then A must con- 
tain a large separable set. Section 16.51 uses these results to prove Theorem 16.11 
Section 16.61 considers some directions for further work. 



6.3 Separable sets 

This section analyses sumsets of separable sets. Recall that a set A C ¥ g (t) is 
separable if its elements can be indexed as 

A = {ai, . . .,a\ A \} 

in such a way that for each 1 < j < \A\ there is a ball Bj with 

A fl Bj = {ai, . . . , a.,} . 

Say that the balls Bj separate A. It is an immediate consequence of the defini- 
tion that a subset of a separable set is itself separable. 

As in Chapter [3] write kA for the /c-fold sumset of A. The following lemma 
shows that if A is separable then kA has essentially maximum-possible cardinality. 

Lemma 6.4. If A is separable then 

\kA\ » fc \A\ k 

for any natural number k. 

Proof. Let E^{A) denote the fc-fold additive energy of A, i.e. the number of 
solutions to 

ai + . . . + a k = h + . . . + b k (6.1) 

with the dj, bi e A. For x G kA write for the number of solutions to 

x = dx + . . . + dfe. By Cauchy-Schwarz as in Lemma [3.111 

\x£kA 



and so it suffices to show that Ek(A) <tik \A\ k , i.e. that there are at most Ok (|>l| fc ) 
solutions to (16. ip . 
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Say that a solution to (16.1 j) is trivial if at least 2k — 1 of the 2k terms occur 
with multiplicity at least 2. By elementary counting there are at most | A| fe ) 
trivial solutions, so it suffices to show that there are no non-trivial solutions. 

Suppose for a contradiction that a non-trivial solution to (16.1 p exists. Gath- 
ering terms gives an expression of the form 

n x c\ + . . . + n t c t = (6.2) 

where the Cj are distinct elements of A and, since we are in characteristic p, 
the rii are integers non-strictly between 1 and p — 1. The assumption of non- 
triviality implies that t > 2, since at least two of the terms have G {1, p — 1}. 
Additionally, note that 

ni + ■ ■ ■ + n t = (mod p). (6.3) 

Indeed after gathering terms on the left the different multiplicities must sum 
to zero, since there are the same number of terms on the left of (16. ip as on the 
right. Any q for which rii = (mod p) is discarded, meaning that the sum of 
the remaining multiplicities is (mod p) as well. 

Now since A is separable and the q are in A we may relabel them if necessary 
to assume the existence of a ball B(x, r) such that c\ B but C2, . . . , c t G B. By 

(E2D, 

\ci — x\ = \riiCi — n\x\ 

= \n 2 c 2 + . . . + n t c t + n\x\. 

Then by (16 .3p and the non-archimedean property it follows that 

|ci - x\ = \n 2 (c 2 — x) + ... + n t (c t - x)\ 

< max{|c 2 —x\,...,\ct — x\} 

< r 

and hence C\ G B(x,r) which is a contradiction. Thus there are no non-trivial 
solutions and the proof is complete. □ 

6.4 Finding many separable sets 

The goal in this section is to show that if the sumset and product set of a set A 
are both small then it must contain a large separable set. For this it adapts the 



110 



6-4- Finding many separable sets 



argument of Solymosi [JT] for complex sum-products discussed in Section 16.2.21 
Note that all of the analysis remains in the ¥ q (t) setting; indeed some of the facts 
of non-archimedean geometry deployed here are manifestly false in C. 
A couple of new definitions are required. Define 

r A (a) = min \ a — a'\ 

a'&A 
a'^a 

B A (a) = B(a,r A (a)). 



Additionally, say that C C A is an A-chain if its elements can be indexed as 
C = {ci, . . . , c n } in such a way that 

B A {c x ) C...C%). 

The following argument, a strengthened form of that found in [H], finds a large 
chain in A as long as the sumset and product set are both small. In the event 
that this condition were to fail there would of course be nothing to prove. 

Lemma 6.5. Any set A contains an A-chain of cardinality 

n 





\A\ l 




\A + A\ 2 \AA 


-| 2 log 3 |A| 



Proof. For each a G A write N(a) for the maximal cardinality N of an A-chain 
C = {ci, . . . , cat} for which cn = a. Note for future reference that 

N(a) < \B A (a)nA\ 

since if C is such an A-chain then C C A by definition and for each c G C we 
have c G B A (c) C B A (a). 

It suffices to find a G A such that 

|A| 5 



N(a) > 



\A + A\ 2 \AA\ 2 log 3 \A\ 



Begin with a dyadic pigeonholing. For each < j < log 2 \A\ define Aj to be the 
set of a G A for which 2 J < N(a) < 2 J " +1 . The Aj partition A and so 

log 2 \A\ 

E i^i = i^i- 

J=0 
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Hence there exists j for which \Aj\ ^> |v4|/log \A\. We shall show that 

2 j > 







5 


\A + A\ 2 \AA\ 2 hg 3 \ 


A\. 



To this end, say that a pair (a, c) G A x A is additively good if 

\(A + A)n(B A (a)+c)\ < 



and that (a, d) E A x A is multiplicatively good if 

\(AA)n(d-B A (a))\ < 



2 i+3j 


AA\ 









Say that a quadruple (a, 6, c, d) G A 4 is good if 

1. a G Ay 

2. be B A {a) n A. 

3. (a, c) is additively good. 

4. (a, g?) is multiplicatively good. 

Write Q for the number of good quadruples. We shall bound Q from below 
to obtain 

Q^2^\A j \\A\ 2 (6.4) 

and bound it from above to obtain 

„ 2 2 i\A + A\ 2 \AA\ 2 , . 

Q « ' |A 2 ' L - (6-5) 

Comparing (16.41) and (16. 5 j) will then give the required bound on 2 J since |A,| ^> 
log Let's first establish (16 .4p . For fixed c G A we have 

J2\(A + A)n(B A (a)+c)\ = Y, ^(ueB A {a)+c) 

a£Aj a£Aj u&A+A 

= E E 1 ^^)) 

veA+A-c aeAj 

= E 1^)1 

veA+A~c 

where Cj{y) is the set of a G Aj with v G B A (a). 
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Note that Cj(v) is an A-chain. This follows from Lemma [6.31 since for any 
two a, b G Cj{v) we have v G B A (a) H B A (b) and so either i? A (a) Q B A ib) or 
B A (6) C B A (a). 

Now since Cj(v) C Aj and Cj(v) is an Achain, there is an a G Aj for which 

< iV(a) < 2 j+1 . 

We therefore have 

J] \{A + A)n(B A (a) + c)\ <y +1 \A + A\ 



a£Aj 



and hence 



\{A + A)n{B A {a) + c)\ < 



2 i+3 






^■1 





holds for at least 3|A, |/4 elements a G Aj. So for fixed c G A there are at least 
3 1 A,- 1/4 elements a G Aj for which (a, c) is additively good. 

By the same argument we may show that for fixed d G >1\{0} there are at 
least 3 1 A,- 1/4 elements a G Aj for which (a,d) is multiplicatively good. 

Thus for any c G A and d G ^4\{0} there are at least \Aj\/2 elements a G Aj 
for which (a, c) is additively good and (a, d) is multiplicatively good, i.e. for 
which conditions 3 and 4 hold. Furthermore for each such a G Aj there are at 
least 2 J elements b G A for which condition 2 holds, since 

2 J < N(a) < \B A (a)nA\ . 

In total therefore, 

Q > |A| 2 |A,|2 J ' 

which concludes the proof of (16.41) . 

We now prove f)6.5p . Note that the map 

(a, b, c,d)^-(a + c,b+ c, ad, bd) 

is injective and so it suffices to bound the number of possibilities for this latter 
expression, subject to the constraint that (a, b, c, d) is good. There are certainly 
at most |A + j4| possibilities for a + c and at most \AA\ for ad, so it suffices to 
show that if these are fixed then there are at most O (2 J \A + possibilities 
for b + c and at most O {2 i \AA\/\A j \) for bd. 
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First establish the bound on the number of b + c. Note that if 

a + c = a + c 

then either 

B A (a) + c C B A (a') + d 

or 

S A (a') + c' C S A (a) + c 

since both sets are balls with the same centre a + c. 

As a consequence, if G C A x A is the set of additively good pairs (a, c), then 

G 

for any x G A + A there is a fixed additively good pair (a x , c x ) such that 

54(a) + c C .04 (aj + a, 

whenever a + c = 2 and (a, c) is additively good. Thus if a + c = x is the fixed first 
co-ordinate and b + c is a possible second co-ordinate then since 6 G 5a (a) R A 
and c G A we have 

b + ce(A + A)n (B A (a) + c) 

C {A + A) n (5aK) + c x ) . 

Since (a x , c x ) is additively good, there are, as required, at most O (2^\A + A\/\Aj\) 
possibilities for b+c. The argument that there are at most at most O (2^\AA\/\Aj\) 
for bd is similar. 
In total therefore 

Q < 

which concludes the proof of (16. 5p and thus of the lemma. □ 

The following result shows that any chain contains a large separable subset, 
allowing Lemma 16.41 to be applied to the chain found in Lemma 16.51 

Lemma 6.6. // C is an A-chain then C contains a separable set of cardinality 
at least \C\/q. 

Proof. Observe that any subset {c±, . . . , c„} C C with 

B A {ci) C . . . C B A (c n ). 
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2 2j+4 \ 


A + A\ 2 \ 


AA\ 


2 




A 3 \ 


2 



6.5. Proof of Theorem \6.1\ 



is separable. Indeed, such a set is separated by the balls B A (ci) because if c i+ i 
were an element of B A (ci) it would follow that r A (ci + i) = r A (ci) and so by Lemma 
16.31 we would have the contradiction B A (ci) = B A (ci + i) 

Define an equivalence relation on elements of A by a ~ b if and only if B A (a) = 
B A (b). To prove the lemma it suffices to show that each equivalence class contains 
at most q elements of A. 

Note first that if a ~ b then 

\a - b\ = r A (a) = r A (b). 

Indeed, since B A (a) = B A {b) it follows that b G B A (a) and so \a — b\ < r A (a). 
However by minimality, \a — b\ > r^(a) and so \a — b\ = ^(o). Similarly \a — b\ = 
r A (b). 

Suppose for a contradiction that there is an equivalence class containing el- 
ements di, . . . , a q+ \. Consider differences a\ — di for 2 < i < q + 1. By the last 
paragraph we have 

|ai - ai| = r A (a 1 ) = r A (ai). 

Now look at the leading terms of the a\ — a*. Since the leading term must be 
non-zero, there are only q — 1 possibilities and so by the pigeonhole principle 
there must exist i ^ j such that ai — and ai — ctj have the same leading term. 
Since 

|ai — cij| = \a\ — dj \ = r A (ai) 

it follows that a± — a, and ai — aj have the same degree, and that this is strictly 
greater than the degree of 

cij — cij = (ai — dj) — (ai — a^). 

This yields the contradiction 

rA(ai) = \ai -dj\ < \a x - Oj| = r A {a%) 

and so concludes the proof. □ 



6.5 Proof of Theorem 6.1 



Theorem 16. II now follows by combining Lemma \Q .41 from Section I6\3l with Lemma 
16.51 and Lemma [6.61 from Section I6~4"l 
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Chapter 6. A sum-product theorem in function fields 



Proof of Theorem \6.1[ By Lemma the set A contains an A-chain of cardinal- 
ity 

n 





\A\ ! 




\A + A\ 


21 


AA 


■| 2 log 3 |A| 



By Lemma 16.61 it therefore contains a separable set S of cardinality 

Ul 5 



ft I 

j\A + A\ 2 \AA\ 2 \og 3 \A\ 



and so Lemma 16.41 implies 

\kA\ > \kS\ > 





.41 


5 


q\A + A\ 


2 


AA\ 


2 log 3 


A\ 



Pliinnecke's inequality (Lemma 13. 2p shows that \kA\ <C f° r an y & G N and 

so combining upper and lower bounds on \kA\ gives 



k 

\A + A\ k ^ k ( — ; „ 9 ',7,„ 1 ._3 I „ ) l^l'''" 1 - 

Taking fc-th roots, we get 





4 


5 


+ A| 


2 


1^1 


2 log 3 


A\ 





A 




L 




q\A + A\ 2 \AA\ 


2 log 3 





Letting k tend to infinity and rearranging then yields 

\A + A\ 3 \AA\ 2 > g \A\ 6 -° {1) 

as required. 

□ 



6.6 Further work 

• Incidences and expanders. Now that we have a sum-product estimate 
in F q (t), it is possible to obtain expander results like Theorem 15 . 1 1 and inci- 
dence results like Theorem 14.11 and Theorem 14.21 for function fields without 
much fuss. These will be stronger than the finite field case, but the gap 
versus the ¥ q (t) sum-product estimate will be quite large. 

Is it possible to do better by working directly with the non-archimedean 
geometry of F 9 (t)? For example, can we obtain an incidence bound in F g (t) 2 
that is almost as strong as the Szmereredi- Trotter theorem in M 2 ? 
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6.6. Further work 



• Computer science applications. At the start of the thesis, we said that 
we would not worry about applications. However it is worth mentioning 
that some existing applications of finite field growth results to theoretical 
computer science may be improved by considering ¥ g (t) instead. 

• Other arithmetic combinatorics problems. There are many more 
topics in arithmetic combinatorics than considered in this chapter, or indeed 
in this thesis. Thomas Bloom, with whom the work in this chapter is joint, 
is investigating a number of such problems in F g (i). His upcoming thesis is 
likely to be worth a read. 
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Appendix A 



Pigeonholing 



This appendix summarises some standard pigeonholing results used throughout 
the thesis. 

A.l Averaging 

Averaging results enable us to take information about the average behaviour of 
a set and deduce the existence of elements with particular behaviour. The most 
basic result of this kind shows that at least one element must be at least average, 
and at least one element must be at most average. It follows by elementary 
pigeonholing and so we record it without proof. 

Lemma A.l. Let A be a finite set of real numbers. Then at least one element of 
A must be greater than or equal to r^r YlaeA a an< ^ °^ ^ eas ^ one must be less than 
or equal to ^ J2 a &A a - 

Lemma IA.1I is so standard that it is used throughout the thesis without ref- 
erence. More developed versions of this approach enable one to show that not 
just one element but many must exhibit behaviour not very much different from 
the average. These are constructed in the main body of the thesis to deal with 
particular situations. 
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A. 2 Dyadic pigeonholing 

The phrase 'dyadic pigeonholing' refers to the following result, which at the price 
of a logarithmic factor allows us to assume that a variable is essentially constant. 

Lemma A. 2 (Dyadic pigeonholing). Let A be a finite set of real numbers strictly 
greater than one and less than or equal to a. Then there exists an integer k and 
a subset A' of A such that every element of A' lies in the interval (k, 2k] and 

\ A '\k^^^. 
log a 

Proof. For each integer < j < |~log 2 a] , let 

Aj =/ln(2 J ,2 J+1 ]. 
The sets Aj partition A and so 

r!°g2 Q i 

a&A j=l 



Hence there is a j for which 



\A 3 \2 j > ^ aeA 



log a 

Setting k = 2 j and A' = A 2 j, the proof is complete. □ 

A. 3 The Cauchy-Schwarz inequality 

The phrase 'by Cauchy-Schwarz' typically has two meanings in the literature, and 
this is reflected in the thesis. It should be clear from the context which meaning 
is implied. The first meaning is the standard Cauchy-Schwarz inequality. 

Lemma A. 3 (Cauchy-Schwarz). Let A, B be finite sets of real numbers. Then 

a£A,beB \a£A J \beB J 

The second meaning is a particular application of Cauchy-Schwarz to pairwise 
intersection of sets, as follows. 
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A. 3. The Cauchy-Schwarz inequality 

Corollary A. 4. Let A be a finite set, and suppose we have a collection of subsets 
Ai C A, indexed by a finite set I. Then 



J2\A\<\A\^ felAnA,l) 



1/2 



Proof. We have 

is/ aeA \iel 

Hence by the Cauchy-Schwarz inequality, 



E i^i ^ \ A \ 1/2 ( E E 1 ( a G A >) 1 ( a G A s)j 

iei \aeA i,jei / 

= i^ 1/2 (Ei^ n ^i) • 



1/2 



□ 
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Appendix B 



Projective geometry 



This appendix gives the background in projective geometry necessary for Chap- 
ters H] and [51 

First it defines projective space FF n over a field F. Then it shows how FF n 
can be considered as the union of affine space F n and a hyperplane 'at infinity'. 
Lastly, it defines projective transformations and establishes some useful facts 
about their transitivity. 

The approach is based on that in [39|. 

B.l Projective space 

Let F be a field. Define projective n-space FF n to be 

where ~ is the equivalence relation given by dilation, i.e if x, y G F n+l \ {0} 
then x ~ y if and only if Xx = y for some A 6 F \ {0}. Elements of FF n are 
therefore equivalence classes, and we write [x] for the equivalence class containing 

x e F n+1 \ {o}. 

We will be concerned with linear subspaces of FF n . In affine space F n an 
(n — l)-dimensional hyperplane is the locus of zeroes x = (x\, . . . , x n ) of a linear, 
possibly inhomogeneous, equation in n variables 

aiXi + . . . + a n x n + a n+ i = (B.l) 



123 



Appendix B. Projective geometry 



where the a, are fixed elements of F. 

But in projective space, an (n — l)-dimensional hyperplane is the set of [x] G 
FF n for which x G F n+1 satisfies a linear homogeneous equation in n+1 variables 

aixi + . . . + a n x n + a n+1 x n+1 = 0. (B.2) 



The homogeneity of (IB.2I) ensures that this is well-defined. Note that [x] and [y] 



lie in the same (n — l)-dimensional projective hyperplane of FF n if and only if x 
and y lie in the same n-dimensional affine hyperplane of F n+1 . 



B.2 The hyperplane at infinity 

It is often helpful to think of FF n as the union of F n with an (n — l)-dimensional 
hyperplane 'at infinity'. The idea is that two parallel (n — l)-spaces are disjoint 
in F n , but in FF n they intersect in an (n — 2)-space on the hyperplane at infinity. 
Moreover, all (n — l)-spaces of the same gradient will intersect in the same (n — 2) 
space at infinity. 

For example, the projective line FF 1 can be viewed as the extended line 
F U {oo}. And the projective plane FF 2 can be viewed as F 2 U where is the 
projective line at infinity. In this latter case, two parallel lines in F 2 intersect in 
a point on Z^, and all lines of the same gradient intersect at the same such point. 

To justify this interpretation, view F n+l as F n x F and identify x G F n with 
[x, 1] G FF n . This preserves hyperplanes, since x lies in the affine hyperplane 
given by (IB.lj) if and only if [x, 1] lies in the projective hyperplane given by (IB.2jh 
This accounts for all elements of FF n apart from those of the form [x, 0] with 
x G F n . These form the projective {n — l)-space given by x n+ i = 0, which we call 
the 'hyperplane at infinity'. The verification of the claim that all (n — l)-spaces 
of the same gradient intersect in the same {n — 2)-space is left as an exercise. 

B.3 Projective transformations 

The group PSL n+ i(F) of projective transformations of FF n is defined by 

PSL n+1 (F) = SL n+1 (F)/±I 
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B.3. Projective transformations 



where I is the identity. Elements are therefore equivalence classes [T] of linear 
transformations T G SL n+ i(F). The group has an action on FF n given by 

[T] [x] = [T(x)}. 

It is easy to check that this action is well-defined, that elements of PSL n+ i(F) 
are permutations of FF n , and that they preserve linear subspaces. 

The action also has an important transitivity property. Say that (n + 2) 
distinct points Pi G ¥F n are a frame if no n + 1 of them lie in the same (n — 1)- 
dimensional projective hyperplane. For example, three distinct elements of FF 1 
are a frame, and four distinct elements of FF 2 are a frame if no three of them 
are collinear. The following result shows that the action of PSL n (F) is sharply 
transitive on frames. 

Lemma B.l (Sharp transitivity on frames). Let (pi, . . . ,p n +2) and (qi, . . . , q n +2) 
be two frames of points in ¥F n . There is a unique projective transformation 
r G PSL n+1 (F) such that 

q n+2 ) = (r(pi), • • • , r(p n+2 )). 

Proof. Let with l<i<n + lbe elements of the canonical basis of F n+1 , let 
/. = [ ei ] g FF n and define 

n+1 

6* — ^ ^ Cj 
i=l 

and /* = [e*]. 

It suffices to show that for any frame (pi, . . . ) p n +2) of points in FF n there 
exists a unique r G PSL n+1 (F) that sends to /j for each 1 < i < n + 1 and 
sends p n+ 2 to /*. Indeed, if this is established then given frames (pi, . . . , Pn+2) and 
(qi, . . . , g n+2 ) there are unique r x , r 2 G PSL n+l (F) that send both to (/1, . . . , / n+i , /*). 
Then /x = r^ 1 ^ is the unique map that sends (pi, . . . ,p n +2) to (q±, . . . , ^+2)- 

We first prove the existence of an appropriate r. Say that p, L G PF n is given 
by Pi = [ti] with ti G F n+1 . Since the first n + 1 points Pi are not coplanar in 
FF n , the corresponding U are not coplanar in F n+1 and so form a basis. Hence 
we can write 

n+1 

tn+2 = ^ KU (B.3) 
i=l 
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with the Aj all elements of F. Let T e SL n+ i(F) be a linear transformation that 
sends the F n+1 -basis (Aiti, . . . , A n+ i£ n +i) to a scalar multiple of the canonical 
basis (ei, . . . , e n+ i). Let r — [T]. Then 

r( Pl ) = [T][t t ] = [T(t i )] = [e t ] = f l 

for each 1 < % < n + 1. Additionally (1B.3j) and the choice of T imply that 



T(p, 



n+2j 



[T(t n+2 )] 



i=l 



f* 



and so we have established existence. 

We now prove uniqueness, for which it suffices to show that our choice of 
[T] e PSL n+ i(F) is the only one that sends pi to fi for 1 < i < n + 1 and p n +2 to 
/*. So suppose that [T] has this property Then [T][ti] = [ei] and so there exists 
Hi G F such that 

T (AA) = Hi e i- 

But since [T] [^+2] = [e*] there exists //* such that 

'n+l 



,i=i 



n+l 

/i* ^ &i- 

i=l 



Combining gives 



n+l n+l 
i=l i=l 



and so by linear independence \i% = A** f° r & U m other words, T sends the basis 
(Aiii, . . . , A n+ it n+ i) to a scalar multiple of the canonical basis (ei, . . . , e n+ i), as 
required. □ 
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